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^ (54) Title: TREATMENT OF CANCER 

^ (57) Abstract: The product of a gene located at human chromosome 21ql 1-21 has ubiquiiin specific protease activity and binds 
to ubiquitin, polyubiquitin, ubiquitin-Uke protein SUMO-3, and proteins which are implicated in the repair/excision of DNA. It is 
Q believed to have a role in regulating cell growth. ccU growth arrest and/or apoptosis and is implicated in carcinoma growth. The 
^ gene has been sequenced and cloned into a microorganism and the product's effect on cell growth is to be investigated. The ^e 
has GenBank accession number AF 1 342 13 and has the HUGO-approved name USP25. 
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Treatment of Cancer 
The present invention relates to the use of the product of the USP25 
gene, which appears to have ubiquitin specific protease activity and may 
have protease activity on ubiquitin-like proteins, located at human 
5 chromosome 21 q1 1 in the treatment, prophylaxis or diagnosis of cancer 
especially solid tumours, more particularly lung cancer. 

Ubiquitin mediated proteolysis by the 26S proteasome is responsible 
for the physiological regulation of the levels of many proteins which are key 
cell cycle and cell growth regulators (Hochstrasser 1995). as well as known 
) tumour suppressors (Lane 1998). 

Ubiquitin is a 76 amino acid polypeptide present In ail eukaryotic 
cells, and highly conserved in evolution. Ubiquitin is conjugated to a target 
protein through an isopeptide bond between the e-amino group of Lys in a 
target protein (ubiquitination). a process mediated by three groups of 
enzymes: ubiquitin activating enzymes (El), ubiquitin conjugating enzymes 
(E2) and ubiquitin ligases (E3). Ubiquitinated proteins exist in a 
monoubiquitinated fomi. or a multiubiquitin chain: the fornier is not a 
degradation signal, while the latter. Lys-48-linked ubiquitin-ubiquitin(n) 
conjugate, works as a strong degradation signal when joined to a Lys in a 
target protein. Protein conjugated to polyubiquitin is then very rapidly and 
efficiently degraded by non-lysosomal, ATP-dependent degradation by the 
26-S proteasome. The de-ubiquitinating enzymes (DUB's) broadly fall into 
two classes: ubiquitin specific proteases (USPs) and ubiquitin-C-terminal 
hydrolases (UCHs) are each capable of de-conjugating the ubiquitin- 
ubiquitin and ubiquitin-protein links, thereby converting polyubiquitin into 
mono-ubiquitin. and de-coupling (deubiquitinating) ubiquitin from the target 
protein, with the result of preventing the degradation of the target protein. 

The class of UCH enzymes tends to include relatively small proteins 
(about 35-40kD) which have low specifity for ubiquitinated proteins. BAP1 is 
a UCH but is unusual in that it is larger, having a weight of nearly lOOkD 
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Several such enzymes are known, the sequences of which show some 
sequence homologies especially in two domains, the Cys and His domains. 

Very broadly, two main functions have been observed among the 
various members of the USP (UBP) superfamily (Wilkinson 1997, Wilkinson 
5 and Hochstrasser 1998). The first is the generation of free ubiquitin from 
precursor fusion proteins or from peptide-linked polyubiquitin after 
proteolysis of the targeted protein, and the second is de-ubiquitination. 
When a protein is targetted for ubiquitin mediated degradation, it is linked to 
ubiquitin via an isopeptide bond between the C-terminus of ubiquitin and a 

10 lysine e-amino group(s) of the acceptor protein. Once the conjugate is 

formed, it can have only two fates: non-lysosomal proteolysis mediated by 
the 26S proteasome resulting in total protein degradation, or de-conjugation 
from ubiquitin (de-ubiquitination), resulting in the rescue of the target protein 
from degradation (Wilkinson 1997, Wilkinson and Hochstrasser 1998). 

15 The regulation of p53 levels by USP25, and consequential prescribed 

increased cellular predisposition to apoptosis (programmed cell death) upon 
overexpression of USP25, could be a mechanism by which neuronal loss 
occurs in Alzheimer's Disease associated with Down Syndrome. Some 
mechanism could be invoked in explaining the sporadic AD if overexpression 

20 of USP25 occurs through a somatic change in aging neurons. 

The potential USP catalysed de-ubiquitination and rescue from 
degradation of tumour suppressors such as p53, could perhaps explain the 
link to deletions of USPs in solid tumours. De-ubiquitination could play a 
major role in the Mdm2 mediated control of p53 levels and its activation 

25 mechanism, since the ubiquitin-mediated proteasome degradation of p53 is 
an important effector arm of this recently revealed control pathway (Haupt et 
al,, 1997. Kubbutat et al., 1997, Lane 1998). Halving the dose of a USP 
could give a pre-cancerous cell a selective advantage in proliferation, by 
diminishing its rate of "re-cycling" of p53. making it more difficult to achieve a 

30 threshold concentration of p53. necessary for its activation. The same 
hypothetical, gene-dose regulated, tumour suppression model could also 
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e^ain wh, .rise, 21 <Down. syndrome DS, .ee.s ,o confer proteCio; 

cancers have very rarely been reported, and are sisnfficantly under- 
repressn,ed compared ,o ,he age-n,a,ched euploid poputalion (Oster e. a, 
1975. Satgeetal., 1998). 

BakereM/. 19^9 also propose a role for USPs in tumour fonnaUon 
and cell growth. 

.0 h, J" ""™^°'°»'^^P™'-"^'^«yin8 polypeptide tags 

H Th "' '-'--^i.u^inand nave J 

levels Of .dentity and similarity (determined using the BLAST algorithm L 

■nstance) to uhi<,u.in Itsell. There is a recognised super fartilyL su^' 

rr s ^^^^^ « 

1997, Sch«,a,. e. ar ,998). The yeast Smt3 and human SUMO-1 (PIC1 ' 
Sentrin, hSm,3C,, SUM0.2 ,hSmt3A, and SUMO-3 ,hSMT3B, belong ,o the 
ame ,am„y o, UhL proteins »-.h approximately 50% identity between 
e~:' T ^ ''-'"^ '^""'^ ^"^^ acid 
1998, Saitoh and Hinchey. 2000), Yeast and human UBC9 are capable of 
0 ^ally yeast or human UbL-s, bu, no. ubi,ui.in (Schwar. : i 

998,^ The SUM0-,.-2 and -3 have the Ctermina, glycine, necessa^ for ' 
^.,u.,na|,o,, 0, the target ^otein. lysine residue, bu, unliKe ubiguiti, do 

IT. T °' PO'Vubi.uitin 

*^- "^^^'=opep.ide bonds. Which are the signaltor the proteasome 
*^ada„on <Saitoh and Hinchey, 2000,. Nevedheless. yeas. Smt3 protein 
can rescue Ihe mutant Mi,2 phenotype, a defrcient centromere binding 

ZT^uun' -i-~on (Me^h and Koshland 
1995X SUM0.1 , as well as SUMO-3 (and probably also SUMO-2) are all 
capable of being a..ached by UBC9 .o RanGapl, a Ran GTP-ase activating 
^^ Kami, ,3,,,gea^ThisATP.epen^ 
'or the b,nd,ng be^veen mod^ed RanGap, and RanBP2 binding protein, in 



>DOCJD; <WO. 



.0079267A2_I_> 



wo 00/79267 PCT/G BOO/02446 

4 

order to form functional nuclear pore complex, which controls export anJ' 
import of molecules through the nuclear envelope (Mahajan et al. 1997, 
Matunis et al. 1998, Lee et al. 1998). In addition, UbL small proteins have 
been shown to modify the death domains of Fas (Okura et al. 1996). Tumour 
necrosis factor receptorl (Okura et al. 1996), PML (a tumour suppressor 
implicated In the pathogenesis of acute promyeiocytic leukaemia) (Kamitani 
et al. 1998b) and Rad51/52 DNA repair proteins (Shen et al. 1996a). Their 
conjugating enzyme, UBC9, has been shown to interact by Y2H technique 
with RAD51/52 DNA repair proteins, and the master tumour suppressor p53 
(Shen et al. 1996b). Another UbL is NEDD8 (Kamitami et a/ 1997). 

UbL's are conjugated and cleared from their targets by enzymes. 
Several UbL hydrolase enzymes have been identified which convert 
precursor UbL to active UbL. Some such enzymes interact with ubiquitin 
itself as well as with other UbL's. Proteases involved in cleavage of 
conjugates of UbL with target protein have been identified for instance 
SENP1 and SUSP-1. which were recently cloned (Kim et al. 2000, Gong et 
al. 2000a), and found to specifically cleave SUMO-1,-2 and -3, but not 
ubiquitin and NEDD8. The first human enzyme with classical USP structure 
(Cys. His domains) for which dual specificity to both ubiquitin and ubiquitin 
like protein was demonstrated was very recently published USP21 on 
chromosome 1q21 (Gong et al. 2000b). However, opposite from SENP1 and 
SUSP-1. this enzyme cleaves ubiquitin and Nedd8, but not SUMO-1,-2 or -3 
(Gong et al. 2000b). 

The proximal third of the chromosome 21 long arm is an exceptionally 
gene-poor region of the human genome as estimated by a number of criteria 
(Shimizu et al.. 1995, Yaspo et al., 1995, Gardiner 1996) the estimates of 
gene-density range from one gene in a megabase to one gene in six 
megabases of genomic DNA. Until recently only three full length genes had 
been mapped in this region: STCH (a member of the hsp70 family) (Brodsky 
et al., 1995), RIP140 (protein functionally interacting with a variety of nuclear 
receptors such as estrogen receptor), (Cavailles et al., 1995) and ANA (a 
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member of the Tob/BTG1 family of tumour suppressors), (Kohno et al 
1998). This region is also an example of extremely highly methylated regions 
in the human genome. 

Greet ei al. (1998) describe a high-resolution bacterial contig map of 
3.4 Mb of genomic DNA in human chromosome 21q1 1-q21. encompassing 
the region of elevated disomic homozygosity in Down's syndrome - 
associated abnomial myelopoiesis .nd leukemia, and which has shown a 
strong association with Alzheimer's disease (AD). It was suggested that the ■ 
h.gh resolution bacterial clone overlap map should be the basis for deriving 
a more complete transcriptional map of that region of the chromosome It 
was hoped that this would lead to an explanation of the chromosome 2lq1 1 
linkage ,n familial early onset AD (FEOAD) families. In particular it was 
suggested that a modifier gene in that region could act together with the 
presenilin-1 gene to generate or modify the AD phenotype. 
15 Further work by the present inventors has revealed a new gene in the 

proximal third of chromosome 21 and that the product of this gene has 
ubiquitin specific protease properties. It is postulated that the gene product 
and USPS generally may have a role in AD. The work has been published in 
Groet et al (2000) after the first priority date of the present application 
20 Valero, etal. (1999) published after the first priority date of the 

present application, have, in parallel identified this gene and pointed out the 
gene product's sequence homologies to known USP's in the conserred 
peptide domains previously identified e.g. by d'Andrea et al (1998) They 
postulate a role in Alzheimer's disease. This protein has the HUGO 
25 approved name USP25. 

According to the present invention there is provided the new use of 
the product Of the USP25 gene located at human chromosome 21q 1 1-21 a 
non-human homologue thereof or a functional fragment thereof in the 
manufacture of a medicament for use in the diagnosis, prophylaxis or 
treatment of cancer. 
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The cancer is usually a solid cancer, most often non-small cell lung 
carcinoma, or skin cancer. 

A functional fragment herein means a protein having ubiquitin specific 
protease activity or UbL specific protease activity and comprising a portion 
5 with sequence homology with the product of the said USP25 gene. 

It is believed that the ubiquitin specific protease activity of the protein 
having sequence ID 1 is responsible for its implication in the pathogenesis of 
cancer. USP activity may be determined using the technique described in 
the Examples elow, in which using bacteria cotransformed with the USP 
10 gene and with a reporter gene encoding a fusion protein which is a ubiquitin- 
conjugated detectable protein. The protein may be an enzyme detectable by 
direct enzyme reaction, by enzyme-linked immune assay techniques, by 
autoradio-graphically or by direct staining after gel separation under 
conditions suitable to separate ubiquitin and cleaved protein from fusion 
15 protein. 

From experiments conducted to determine with which proteins the 
product having sequence ID 1 interacts, we have found that there is . 
interaction with ubiquitin, polyubiquitin and various ubiquitin precursors, as 
well as HHR23A (Matsutani et a/ 1994. GenBank Accession No D21235). 

20 There is also interaction with other ubiquitin-like proteins and with proteins 
which are known to interact with ubiquitin-like proteins, such as Sumo-3 
(Mannen etal 1996, Kamitani et al 1996. Saitoh a/2000. GenBank 
Accession No. NM 006937) and ubiquitin-like-specific conjugating enzyme 9 
(Schwarz et al 1998, Lee et a/ 1998, GenBank Accession No. U 66867). The 

25 isolated protein may therefore be characterised further by having a positive 
interaction in a yeast-two-hybrid procedure with one or more, preferably all 
three, proteins having the sequences of GenBank Accession Nos. D 21235, 
NM 006937 and U 66867. Ubiquitin-like specific protease activity may be 
determined using techniques analogous to those used to determine ubiquitin 

30 specific protease activity, by using a substrate which is a fusion protein of 
the ubiquitin-like protein of interest and a detectable protein, and using the 
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usual separation and immune based or autoradiographic identification 

techniques. 

The gene may be transcnbed and translated within a cell line selected 

as positive for the native gene or an active (in tenns Of Ubiquitin or UbL 
specftc protease activity) fragment thereof. The gene has preferably been 
introduced into a microorganism or a cell line in a fom, in which it can be 
transcribed and translated and the microorganism or the cell-line as the 
case may be, has been cultured under conditions whereby the gene is 
repl,^ted during cell division, and is transaibed and translated into ubiquitin 
specie protease and the USP is recovered Preferably the gene in the 
microorganism is recombinant DNA derived from the mRNA from cells 
hav,ng the active USP gene. Suitably the gene Includes sequence ID No 7 
more preferably sequence ID No. 6. 

According to a further aspect of the invention there is provided a new 
use of a protein product having Cys, QQD and His domains specified in 
sequence IDs numbers 2, 3 and 4, respectively, in the manufacture of a 
medicament for use in the diagnosis, prophylaxis or treatment of cancer 
The protein has ub,qui.in specific protease or ub^urtin-like specific pmtease 
activity, and has the three specified domains in the USP o, UbLSP active 
confonnat,on Preferably the protein has. outside the specified domains 
some level of sequence homology with sequence ID 1. for instance at least 

20^. preferably at least 50%. identity With that sequence, and a level Of 
similarity Of a. least 50%. preferably a. least 70% or more with that sequence 
(in each case delemtlned using, for instance the BLAST algorithm) 
Preferably the protein has sequence ..D1 or sequence l.D.5. Homologues 
such as the corresponding mouse product, described in Valero el a/ 1999 ' 
may be used or sequences which have the above levels of identity and 
similarity with such a sequence 

According to a prefened embodiment the protein has sequence 1 D 1 
^ According to another prefened embodiment the protein has sequence 
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According to a further aspect of the invention there is provided an in 
vitro method in which mammalian cells are cultured in the presence of the 
product of the USP25 gene located at human chromosome 21 q1 1 or of a 
homologue thereof, or a functional fragment of said product. 
5 According to a further aspect of the invention there is provided an in 

vitro method in which mammalian cells are cultured in the presence of a 
protein product having Cys, QQD and His domains specified in sequence 
IDs numbers 2, 3 and 4, respectively. Preferably the protein has, outside the 
specified domains, some level of sequence homology with sequence ID 1, 
1 0 for instance at least 20%, preferably at least 50%, identity with that 

sequence, and a level of similarity of at least 50%, preferably at least 70% or 
more with that sequence (in each case determined using, for Instance the 
BLAST algorithm). Preferably the protein has sequence I.D.1 or sequence 
I.D.5. Homologues, such as the corresponding mouse product, described in 
1 5 Valero, et al 1 999 may be used or sequences which have the above levels of 
identity and similarity with such a sequence. 

In these in vitro methods the effect of the protein on cell growth, cell 
growth arrest and/or apoptosis is assessed. 

The microorganism containing the specified gene has preferably been 
20 transformed with a vector comprising at least a portion of sequence I.D.6 
comprising residues 199 to 3367, optionally including an additional 96 b.p. 
exon Inserted after base 2356, and optionally including 5' and/or 3' 
untranslated regions (UTR's). The vector preferably comprises 
transcriptional and translational control sequences. The microorganism may 
25 be a yeast but is most conveniently a bacterium. 

A mammalian cell transfected with a vector preferably comprises 
exons making up a sequence comprising residues 199 to 3367 of sequence 
I.D.6 and optionally also regulatory genomic sequences which are present in 
wild-type chromosome which lead to enhanced expression activity. 
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The sequence from residues 199 to 3367 of sequence ID no 6 is 
spec.f,ecl as sequence ID7 and consists of start codon (ATG). an open 
reading frame of 3165 nucleotides, and a stop codon (TAA) 

According to a further aspect of the invention there is provided a new 
use Of DNA including the gene located at human chromosome 21q 11-21 or 
a fragment thereof encoding a functional USP product in the manufacture of 
a med,cament for use in the prophylaxis or treatment of cancer. The DNA is 
preferably incorporated in a gene therapy vehicle, and is preferably part of a 
vector, for instance a plasmid vector, or viral vector, capable of transfecting 
cells ,n v,vo. The medicament usually includes a pham^aceutically 
acceptable earner. 



which 



15 



The invention is illustrated further in the accompanying drawings in 

Figure 1 represents a map of human chromosome 21 
Figure 2 represents a map showing the location of exon trapped 
products from the experiments reported below 

Figure 3 represents sequence homologies of USPs ' 
and 

Figure 4 represents the results of the experiment illustrating USP 
20 properties reported below. 

■f'^«'*"n9 specific description desaibes the work Which has been 
carried out. 

Experimental 

We identity a portion of human chromosome 21 homozygously 
deleted in non-small cell non carcinoma (NSCLC) for further study The 
reg«n contained the DNA marker with tha highest NSCLC-associated loss of 
homozygosity (LOH), reported by Kohno e, a,. We found a shared region ot 
overlap (SRO) tor the hemizygous loss in other NSCLC. The current worK is 
to identify genes in the SRO which have a potential role In tumour 
30 suppression. 



25 
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A total of 42 fresh NSCLC cases have been analyzed from the 
Croatian Tumour Bank (CTB), an initiative with set rules and criteria for 
accumulation of fresh clinical tumor specimens for molecular studies 
(Spaventi et aL, 1994). Of these, half (including tumors #47 and #61) were 
5 samples that were recently studied for LOH of the NM23-H1 gene (Bosnar et 
aL, 1997). and the other half were fresh tumors (data obtainable from CTB, 
which also lists the tumor stage in the TNM system, grade, size and survival 
data). In each case, the tumors and normal lung tissue specimens (as 
evaluated by the surgeon) were frozen in liquid nitrogen in the operating 

10 room and further stored at -70°C Genomic DNA was isolated using standard 
procedures (Sambrook et ai, 1989). For each sample, 4pm serial frozen 
sections were cut, mounted on glass slides, and stained with hematoxylin- 
eosin (H&E). A pathologist confirmed the histologic type of the tumor and 
evaluated the percentage of normal cells within the tumor. Only samples 

15 with less than 20% non-tumor cells were used in this study. 
Markers and LOH Analysis 

Microsatellite analysis was performed using polymerase chain 
reaction (PCR) with appropriate primer pairs (sequences and PGR 
conditions as in Genome Data Base, Johns Hopkins University, Baltimore, 

20 MD), where the forward primer only from each pair was 5' fluorescently 

labeled with Applied Biosystems (ABI; Foster City. CA) Big Dyes™ (6-FAM 
or HEX). Amplification products were analysed using an ABI 310 Genetic 
Analyzer. Size standards (GeneScan 350) were mixed with every sample for 
accurate sizing; the separation of the mixture of denatured fragments was 

25 achieved by electrophoresis through a 47 cm capillary (module GS STR 
P0P4 C) for approximately 30 min. Raw data were analyzed using 
. GeneScan and Genotyper software. LOH ratios were calculated exactly as 
described in the GeneScan Applications manual provided by ABI. For each 
individual allele's fluorescence level, an average of 3 independent 

30 electrophoresis-analysis cycles on the ABI 310 was used for calculation. 
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Fluorescence In Situ Hybridization (FISH) 

Unstained 4Mm-thick paraffin block sections were fixed to glass slides 
and a standard pretreatment protocol was followed for formalin-fixed ' 
paraffin-embedded slides. P1-derived artificial chromosome (PAC)DNA was 
labelled with digoxygenin-1 1 -dUTP (Boehringer Mannhein. Germany) 
Approximately 0.5 pg of each labelled PAC DNA sample was mixed with 5 
Mg of Coti DNA (Gibco BRL. Gaithersburg. MD). precipitated, denatured 
allowed to preanneal. and then applied to a denatured slide and hybridized 
overnight. Slides were washed and signal detected using anti-digoxygenin- 
rhodamine. followed by DAPI counterstain. Images were captured using a 
Ze.ss Ax.oskop microscope equipped with a charge-coupled device (CCD) 
Photometries. Tucson. AZ) connected to an Apple Powermac 8100 
computer. Images were captured on 3 levels of focus, and each level was 
examined for signals using SmartCapture software (Vysis. Inc., Chicago IL) 
Only nuclei with signals were counted in each level, and the number of 
signals in each cell was determined. B: FISH using a pool of PACs 90B5 
126N20 and BAC 391 12 as a probe on the paraffin embedded sections of 
the tumour #61. Two signal nuclei are predominant. C: FISH using a pool 
of PACs 73M5 and 135E14 as a probe on the paraffin embedded sections of 
the tumour #61. Single signal nuclei are predominant. 
Northern Blot Analysis 

The cDNAs were labelled by random priming and hybridized to human 
multiple tissue Northern blots (Clontech. Palo Alto. CA) containing 2 pg 
polyA + RNA per lane using the protocol recommended by the manufacturer 
The exposure was for 14 hr to Molecular Dynamics (Sunnyvale, CA) 
Phosphorimager screens. The I.M.A.G.E. Consortium (Lennonef a/ 1996) 
CDNA clone ID 824710 and the Unigene clone A002B43 have been used as 
labelled probes in separate experiments. 
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Cleavage Analysis of 

Ubjquitin-Met-p-Galactosidase Fusion Protein 

This analysis was performed essentially as described (Everett et al., 
1997). Model fusion protein ubiquitin-Met-p-galactosidase in a pACYC184 
5 (Cm' replicon) was represented by the plasmid pACYC-Ub-Met-p-gal, a kind 
gift of R. Everett. Plasmid pRB105 containing a Saccharomyces cerevisiae 
ubiquitin-specific protease UBP2 in an IPTG-inducible pBR322 (Amp' 
replicon) was a kind gift of R. Baker, and was used as a positive control. 
The new gene USP25 was cloned from nucleotide position 203 to nucleotide 

10 position 3367 (numbering as in GenBank AF 134213) into SacUSa/l cloning 
sites of the IPTG-inducible Escherichia coli expression vector pQE30 
(Qiagen, Chatsworth. CA). The E. co// XL-1 blue cells were transformed 
using a standard rubidium chloride-heat shock method with the combination 
of pACYC-Ub-Met-p-gal and either pQE30 vector, pQE30'USP25, or 

15 pRBIOS. and each of the 3 cotransormants was selected on medium 

containing chloramphenicol (42 pg/ml) and carbenicillin (75 pg/ml). Western 
blots were prepared by electrotransfer to a nitrocellulose membrane 
(Schleicher & Schull. Keene, NH), The p-galactosidase-containing bands 
were detected by an anti-p-galactosidase polyclonal rabit antiserum (a kind 

20 gift of R. Everett) using an enhanced chemiluminescence (ECL) assay kit 
(rpn 2132; Amersham, Arlington Heights, IL) under conditions recommended 
by the manufacturer. 

Identification and cloning of USP25 

25 Twelve sequenced exon-trapped products, when analysed using 

BLAST-N against public sequence databases, revealed clusters of 
overlapping cDNA clones. Sequences of our exon-trapped products matched 
exactly the sequences of the cDNAs forming contigs with a large open 
reading frame (ORF). In three cases (see Fig.2): from EST 824710 to 

30 AA209364, from AA307805 to AA081 200 and from N92952r to Z4501 0, our 
trapped exon sequences served to bridge the gaps in the gene sequence 
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The combrned sequence (sequence ,D no 5 and GenBenk accession 
number AF,3«,3, revealed a 199 bp 6'UTR, s,ar. codon. an ORF of 3,65 
~es encoding a protein o, ,055 a.ino ac.s, a s.op codon. a 3 OT^ 
5 Of 3 nucleotides and a po,yadeny,a.ion signal. The .o,al ,eng«, 1 

nt™ ?t ^ '-'^ ('"**9 .he 

or. Hu.an lung .issue, ^ a varying in,ens«, „ Is .os, prominent in 

hode hybr,d,s,ng transaipt o, 1.. Kb. All tissues also show a larger weaker 
^and o « kb, Which could be dMe ,o an alternative polyadenyla.L site 

PACs (^3M5 and ,35E,4, became public!, available by the German Human 
Geno S consortium ,EMBU aceesston numbers A.0,05e7 ar. 

^0,0598). companson of the genomic sequence ^.h the overlapping 
^NA ..ones and axon sequences revealed that ,2 out of the 24 exon .3d 
he^ ~pped (hatched rectangles in Fig.2, „ also became apparent 

he known chromosome 2, CpG isl^d a. D2,S382 (also kr^ as LL56 Not ' 
n.nk,ng Clone on the Not, Physical map Of 2,q,lchikawa eta. ,993, 

When the deduced polypeptide sequence was compared to Swisspro, 
d o her public databases using BUST-P, a dear pattetn of sign«cant 
hmo^^,e=,o..„,,»,,<,,,,„,_^,^ ; J 

eukaryo.es was found (Fig.4,: all of these, proteins belonged to the 
superfamily of ubiquiUn specie proteases (USP-s, or ubiquitin carboxy- 
.emttnal hydrolases (UCH-s, (Baker e. al.. , 992, Swanson et al ,996 
Everea e. a,.. ,997. «,k,nson ,997, Hansen-Hagge et al., ,998; .ensen et 
a... 1998, Fu,»,ara et al., 1998, Wilkinson and Hochstrasser , 998 The C 

2'hl?""'"' """"^ ''''' "^"'"^ 

mos h gb y ,^ ^^^^^ ^^^^ 

and the H,s box, Fig.4, known to be essential for the ma^ ft«,ci<„ of these 
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enzymes: the cleavage of ubiquitin at its carboxy terminus from extension 
proteins (ubiquitin precursors) and ubiquitinated proteins and protein 
fragments targetted for the degradation by the 26S proteasome pathway 
(Wilkinson and Hochstrasser 1998). The Cysteine residue at position 178 
5 and the Histidine residues at positions 599 and 607 (marked with an asterix 
in Fig,4), which were shown to be an absolute requirement for the function of 
USP-s and UCH^s (Amerik et al., 1997, Hansen-Hagge et al.. 1998, 
Wilkinson and Hochstrasser 1998) were found in the correct positions in the 
sequence of the new gene. Since this is a first member of the USP family 
10 known to map onto human chromosome 21 , we named this protein USP25, 
for Ubiquitin Specific Protease on Chromosome 21. 

The novel protein (USP25) cleaves ubiquitin from carboxy- 

terminal fusion proteins 

The ability of USP25 to cleave a model ubiquitin fusion protein 

15 substrate was investigated by co-expression in E Coli, The complete coding 
sequence of USP25 was cloned into a T5-driven, IPTG inducible expression 
vector (pQE30). The new gene USP 21 was cloned from nucleotide position 
203 to nucleotide position 3367 (numbering as in sequence ID no. 2 into Sac 
/Sal cloning sites of the IPTG-inducible E.coli expression vector. As a 

20 positive control, the plasmid pRBIOS containing a UBP2 gene encoding a 
S.Cerevisiae ubiquitin specific protease in an IPTG inducible and Amp*' 
vector was used. The XH blue strain of E. Co// was co-transformed with 
the plasmid containing a ubiquitin-Met-p-galactosidase model fusion protein 
in an IPTG-inducible and chloramphenicol resistant vector, in addition to 

25 either pQE30 vector, pQE30-USP25 or the positive control (pRBIOS). (each . 
of the 3 co-transformants was selected on medium containing 
chloramphenicol (42 pg/ml) and carbenicillin (75 pg/ml). Co-transformants 
were grown to exponential phase, IPTG induced, and the crude protein 
extracts from these cultures were analysed by Western blot using an anti p- 

30 galactosidase antibody. (The western blots were prepared by electro 
transfer to a nitro cellulose membrane (Schleicher and Schuel.)). The p- 



INSDOCID: <WO 0079267A2J_> 



wo 00/79267 

PCT/GB00/D2446 

15 

galactocidase containing bands were detected by an anti-p galactosidase 
polyclonal rabbit anti serum using enhanced-chemiluminescence assay kit 
(ECL. Amersham rpn2132) under conditions recommended by the 

manufacturer. 

5 As can be seen in Fig.4. the uncleaved Ub-Met-p-gal substrate (band 

labelled with an asterix in Fig.4, lane 4) converts to an 8 kDa shorter band 
(tnangle in Fig.4) in the cells co-transformed with either USP25(lanes 5 6) or 
the yeast UBP2 expressing plasmid (lanes 9.10). Constitutive expression of 
USP25 (lane 5) is quite sufficient to cleave to completeness the low levels of 
model substrate. The more prominent and highly induced band migrating 
Slightly further in the gel than the de-ubiquitinated cleavage product is the 
truncated form of b-galactosidase expressed by the XL-1 blue bacteria 
(compare to lanes 1.2 in Fig.4). This result demonstrates that the novel gene 
product named USP25 can efficiently function as a de-ubiquitinating 
enzyme. 

From the homologies in the functional domains and from its ability to 
hydrolyse the bond between the C-terminal double glycine of ubiquitin and 
the l,nk,ng methionine residue (Fig.5). it can be concluded that USP25 is a 
member of ubiquitin specific proteases. 

Determination of Proteins with which USP25 interacts 
Functional analysis of USP25 was performed with the aim of detecting 
the cellular proteins which interact with the USP25 protein through protein- 
protein interaction, using Yeast-Two-Hybrid (Y2H) approach 
Saccharomyces Cerevisiae yeast has well characterised ubiquitin activating 
conjugating and ligating ezymatic machinery, capable of ubiquitinating 
human proteins (Scheffner et al. 1998). A cDNA library from human brain 
Cloned in "prey" vector, was co-transfected to yeast cells with USP25 cloned 
'n "ba.t» vector. Since ubiquitin cleaving activity of USP25 was proven (Groet 
et al. 2000). this technique has a theoretical chance of detecting the natural 
cellular substrates for ubiquitin cleavage and de-ubiquitination by USP25 
Since the action of ubiquitin cleavage is very rapid (Wilkinson and 
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Hochstrasser 1998), the cleavage and dissociation from its natural 
substrates for fully active USP25 could preclude the ability to detect the 
interaction through Y2H. In addition, the artificial cross-de-ubiquitination of 
yeast's ov^n proteins by an overexpressed USP25 could theoretically be 
5 harmful for the yeast cell and/or for the molecular interactions required for 
the Y2H. For these reasons we performed Y2H using USP25-C178A, a site 
directed mutant we recently engineered (the mutation being of the key Cys 
residue in the Cys region) which abolishes the capacity for cleavage of 
ubiquitin by USP25. but should not interfere with the binding of USP25 to its 

10 natural ubiquitinated substrates, since this residue is conserved between all 
UCH-s and USP-s so far identified. 

Y2H experiment using USP25-C178A cloned in the yeast two hybrid 
"bait" vector pAS2 (Clontech) co-transformed into yeast cells together with a 
human adult brain cDNA library in the "prey" vector pACT2 (Clontech). 

15 Interacting events were visualized by the activation of transcription of all 
three reporter genes: Ade2, Mell and His3. Interacting "prey" sequences 
were verified by PCR-sequencing on the ABI310 automated sequencer, 
using universal vector primers, and analysed by BLAST search on non- 
redundant genome and transcriptome sequence databases. The accession 

20 numbers of the sequences found to be interacting, from the GenBank 
database are given in the table. 
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Table 1. Summary of frequency and identities of specific interacting" 
proteins from human brain with USP25-C178A. detected using Yeast- 
Two-Hybrid technique 



SufniTidrv of Rpcuite Kt< 
decreasing frequency of 
detection of baits 


Number of specific 
independent clones 
"fished" by Y2H 


Accession number 




8 clones 


D21235 


SUMO-3 


1 8 clones 


NM 006937 


human UBC9 


5 clones 


U66867 


polyUbiquitin 


4 clones 


AB009010 


Ubiquitin 


3 clones 


X04803 


Ran BP2 protein 


1 clone 


NM 006267 


Various ubiquitin-like 
precursors (1 or 2 clones 
each): 


4 clones 




Other proteins (1 or 2 
clones each) 


10 clones 





Conclusions - Interaction with HHR23A 

DNA repair plays a key role in prevention of carcinogenesis and 
mutagenesis. This is potentially of special significance to solid tumours 
Which have exposure to UV and chemical carcinogens as the major risk 
factor, such as cancers of the skin and lung. HHR23A is a homologue of 
yeast RAD23 protein (Masutani et al. 1994). involved in DNA excision-repair 
after UV damage and implicated in spindle pole body duplication and cell 
cycle progression in yeast (Watkins et al. 1993. Biggins et al. 1996) Human 
homologues HHR23A and B (Masutani et al. 1994) both belong to a group of 
proteins which, when mutated, lead to Xeroderma Pigmentosum, a rare 
autosomal recessive disorder associated with a high incidence of sunlight 
(UV) induced skin cancers. 

More importantly. hRAD23A has also been isolated as a primary 
interacting protein by the same Y2H technique using E6AP as a "baif 
(Kumar et al. 1999). E6AP (Human Papilloma Virus E6 associated protein) 
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functions as one of the two so far detected ubiquitin ligases (attaching 
ubiquitin and labelling for degradation) for the master tumour suppressor p53 
(Scheffner et al. 1993). The p53 and HHR23A are the only two so far proven 
targets for this ubiquitin ligase (Kumar et al. 1999).. Since USP25 shows 
5 high rate of target preference for HHR23A (see Table 1 ), and both HHR23A 
and p53 are ubiquitinated by E6AP, it could mean that they are both de- 
ubiquitinated by USP25. The fact that Y2H with USP25 did not pick up p53 is 
understandable, because p53 is expressed in small traces (very low level) in 
normal tissues, and gets only accumulated and activated following DNA 
10 damage or other stimuli for programmed cell death (apoptosis) (Haupt et al. 
1997, Kubbutat et al. 1997, Lane 1998), Further experiments are therefore 
justified to provoke the p53 response, and monitor the effects of USP25 on 
p53 levels. 

Moreover, lack of functional E6AP accelerates the polygiutamine- 

15 induced neuronal cell death in the mouse model for the neurodegenerative 
disease Spinocerebellar-ataxia 1 (SCA1) (Cummings et al. 1999). Lack of 
E6AP gene in a mouse expressing the polyglutamine stretch mutation of 
SCA1 protein dramatically reduces the presence of ubiquitinated 
intranuclear neuronal inclusions, but drastically accelerates the neuronal 

20 degeneration and cell death (Cummings et al. 1999). A very similar effect 
has been observed in Huntington's Disease (HD), where a dominant 
negative mutant of a ubiquitin conjugating enzyme (UBC3), when co- 
expressed in cultured neurons with the huntingtin protein bearing the 
polyglutamine extension (mutation causing HD), drastically reduces the 

25 presence of ubiquitinated intranuclear neuronal inclusions, but drastically 
accelerates the neuronal degeneration and cell death (Saudou et al. 1998). 
If USP25 de-ubiquitinates a similar set of target proteins to the ones 
ubiquitinated by E6AP, then the overexpression of USP25 may lead to 
similar effects as the inhibition of ubiquitin conjugation by E6AP. It is 

30 therefore justified to examine the effects of overexpression of USP25 (and 
other USP-s) on neuronal toxicity. 
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Interaction with human UBC9 and ubiqujtin like proteins {UbL)'= 
In yeast, there are 13 ubiquitin conjugating enzymes (E2 enzymes) 
(Scheffner et al. 1998). Only two (UBC3. mentioned previously in context 
with HD, and UBC9) are essential for cell cycle progression. Without UBC3 
the cell cycle is arrested at the transition point from G1 to S phase whereas 
without UBC9 the cell cycle is an-ested at the transition point from G2 to M 
phase (Scheffner et al. 1 998). Yeast UBC9, and its mammalian homologue. 
e.g. human UBC9 have a special function among all other UBC(E2) 
enzymes, in that they are specifically not conjugating to target proteins the 
molecule of ubiquitin. but rather of Ubiquitin-Like small proteins (UbL) 
(Gong et al. 1997. Schwarz et al. 1998). The yeast Smt3 and human SUMO- 
1 (PIC1. Sentrin. hSmtSC). SUMO-2 (hSmtSA) and SUMO-3 (hSMT3B) 
belong to the same family of UbL proteins with approximately 50% identity 
between themselves, and some 1 5-30% identity and 40-60% similarity in 
amino acid sequence to ubiquitin (Lapenta et al. 1997. Mannen et al. 1996 
Kamitan. et al. 1998. Saitoh and Hinchey. 2000). Yeast and human UBC9 " 
are capable of conjugating equally yeast or human UbL-s. but not ubiquitin 
(Schwarz et al. 1998). The SUM0-1.-2 and -3 have the C-temiinal glycine, 
necessary for ub.quitination of the target protein's lysine residue, but unlike 
ubiquitin. do not have the Lys48 residue necessary for the fomiation of 
polyubiqu.t.n chains through isopeptide bonds, which are the signal for the 
proteasome degradation (Saitoh and Hinchey, 2000). Nevertheless, yeast 
Smt3 protein can rescue the mutant Mif2 phenotype. a deficient centromere 
binding protein resulting in chromosome missegregation (Meluh and 
Koshland 1995). SUMO-1. as well as SUMO-3 (and probably also SUMO-2) 
are all capable of being attached by UBC9 to RanGapl. a Ran GTP-ase 
activating protein (Kamitani et al. 1998). This ATP-dependent attachment is 
essential for the binding between modified RanGapl and RanBP2 binding 
protein, in order to form functional nuclear pore complex, which controls 
export and import of molecules through the nuclear envelope (Mahajan el al 
1997. Matunis et al. 1998. Lee et al. 1998). In addition. UbL small proteins 
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have been shown to modify the death domains of Fas (Okura et al. 1996^ 
Tumour necrosis factor receptorl (Okura et al. 1996). PML (a tumour 
suppressor implicated in the pathogenesis of acute promyelocytic 
leukaemia) (Kamitani et al. 1998b) and Rad51/52 DNA repair proteins (Shen 
5 et al. 1996a). Their conjugating enzyme, UBC9, has been shown to interact 
by Y2H technique with RAD51/52 DNA repair proteins, and the master 
tumour suppressor p53 (Shen et al. 1996b). 

The USP25 Y2H data show dear pattern of interaction in the UBC9 
pathway. Interaction with UBC9 itself, is to our knowledge the first of the kind 

10 demonstration of a direct protein-protein interaction between a USP and a 
conjugating enzyme. Interaction with RanBP2 and SUMO-3 clearly shows 
that USP25 could be sharing the similar target repertoire as UBC9 (Saitoh et 
al. 1997). USP25 may be removing the SUMO (UbL) molecules attached to 
targets by UBC9. Alternatively, USP25 may be preparing the UbL-s for 

15 attachment by UBC9 to targets, by removing the oligopeptide extensions 
after the C-terminal Gly-Gly group from the UbL-s. The only ubiquitin 
protease found to be essential for yeast cell cycle progression, Ulpl, was 
found to be specific to Smt3 removal, and not to ubiquitin (Li and 
Hochstrasser 1999). This protease also had a completely different sequence 

20 from known USP-s and UCH-s. Its human homologues, SENP1 and SUSP-1 . 
were recently cloned (Kim et al. 2000, Gong et al. 2000a), and found to 
specifically cleave SUM0-1.-2 and -3, but not ubiquitin and NEDD8, another 
UbL (Kamitani et al. 1997). The first human enzyme with classical USP 
structure (Cys, His domains) for which dual specificity to both ubiquitin and 

25 ubiquitin like protein was demonstrated was very recently published USP25 
on chromosome 1q21 (Gong et al, 2000b). However, opposite from SENP1 
and SUSP-1 , this enzyme cleaves ubiquitin and NeddS, but not SUMO-1,-2 
or -3 (Gong et al. 2000b). 

If USP25 cleaves the same UbL proteins to which it binds by Y2H 

30 (see Table 1 ), it would appear that it is the first classical USP capable of 
cleaving both ubiquitin and human SUMO family, which maybe very 
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significant to its function. Moreover, it is possible that USP25 is actually ' 
specific for SUMO-3 rather than SUMO-1 or SUMO-2. The distinction 
between these targets would be the first of the kind, but remains to be further 
confirmed. The previously observed interaction with hRAD23A (Table 1), as 
well as with other various precursor proteins containing the ubiquitin-like- 
domains (Table 1), could be linked also to its affinity to a certain type of 
ubiquitin like domain. 

Finally, and very importantly, SUMO molecules have been shown to 
attach post-translationally to p53 tumour suppressor itself, by the 
conjugating enzyme UBC9 (Rodrigues et al. 1999. Gostissa et al. 1999). 
This modification for SUMO-1 in vitro requires only SUMO-1, the SUMO-1 
activating enzyme and UBC9. The authors of Rodrigues et al. therefore 
conclude that SUMO-1 modification pathway acts as a potential regulator of 
the p53 response and may represent a novel target for the development of 
therapeutically useful modulators of the p53 response (Rodrigues et al 
1999). 

We would propose that the cleavage of ubiquitin like proteins, 
and its association with UbC9 could be an alternative pathway explaining 
USP25's role in cell cycle control, and through it. its role in control of 
programmed cell death, with direct implications in both tumour suppression 
and neurodegeneration. 



Figure Legends 

Figure 1. Identification of the Shared Region of Overlap 
(S.R.O.) for hemizygous deletions in 21q11-q21 in NSCLC.A: Cytogenetic 
map. Not I long range physical map (Ishikawa, etal., 1993). YAC contig 
(Nizetic, et al 1 994, Shinizu et al 1 995 and Bosch et al 1 996). and bacterial 
contig, (Groet ef a/ 1998) are shown in consecutive horizontal layers, 
respectively, above the line showing the markers used in the LOH analysis 
(oval symbols). Markers are named as In Genome DataBase (prefix "D21" 
omitted). In the column under each marker an "X" (symbolizing LOH). "+" 
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(an absence of LOH) and "U" (un-informative, homozygous result) for that 
marker in the set of eleven Croatian Tumour Bank (CTB) tumours, or in 
individual tumours #47 and #61 are shown. NT=not tested. For comparison 
with our data, markers used as probes on the genomic Southern blot of the 
5 NSCLC cell line, and/or in LOH analysis of fresh tumours in the study by 
Kohno and co-workers (Kohno et a/ 1998) are indicated above the empty bar 
symbolizing the homozygous deletion they found. In our data, hatched bars 
indicate hemizygous deletions, and black filled bars indicate segments 
showing absence of LOH or deletions. Squared symbols "X" and "+" stand 

10 for predominantly single and predominantly double signal, respectively, 

detected by FISH on interphase nuclei of the paraffin embedded sections of 
the tumour #61, when PAG clones named and indicated as bold lines in the 
PAC contig above the markers line, were used as probes. 

Figure 2. Trapped exons (hatched rectangles) and exons 

15 deduced from overlapping sequence analysis (white rectangles) defining the 
exon-intron structure of the new gene USP25. Top half shows two PACs 
73M5 and 135E14, also used as FISH probes in Fig. 1. which were the 
source of genomic DNA for exon trapping. Exon locations on the PACs are 
shown with vertical bars, and the 50 kbp scale bar refers to this part. Bottom 

20 half consists of overlapping cDNA fragments corresponding to exons above 
them, drawn in the same scale, (500 bp scale bar is shown). Names of 
cDNA clones are as in dB-EST and UniGene databases, 824710 is the 
address of the clone in the IMAGE Consortium collection. The complete 
cDNA sequence for the whole gene is the new GenBank entry with the 

25 accession number AF 1 342 1 3. 

Figure 3. Comparison of protein sequences of USP25 to 
other eukaryotic members of the superfamily of USP-s. The protein BAP-1 is 
actually from the family of Ubiquitin C-terminal Hydrolases, a distinct sub- 
family of this superfamily, showing homology only in the single key 

30 aminoacids in the Cys and His domains. Two reports show the localisations 
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Of the highly homologous sequences for ,he HAUSP gene to 3p21 (Kashuba 
e/a/,997,and,6p,3(Rob^son,e,an998,,respeetively. 

Figure 4. Demonstration of ,he de-ubiqui«naling aaivity of 
USP25 on a model ubiquitin fusion protein. Western blo, of an SDS-PAGE 

ZtITT? ^""-^-^^'-'-^-^ Lanes ,,2: «,e E. cof, 

XL-, blue cells alone (in all cases second lir« offhe pair is .,PTG) Lanes 
3 4: same cells co-,ransfecled w«h the model fusion protein encoding 
Plasm,d PACYCUB-Met-p-galacosidase protein, band labelled wifh an 
aslanx). Lanes 5,6: as lanes 3,4 except pQE30-USP25 (ful, length USP25 

gene c^ned in me PQE30 expression vector, was added instead Of PQE30 
Lanes 7,8: same as lanes 3,4 except pRBIOS (yeast de-ubiquitinating 
en^me UBP2) was transfected instead o, pQE30. Lanes 9 ,0 over 
exposure Of lanes 7,8. Note the presence of the 8kDa shorter de- 
"blquitinaled Met-^galactosidase (band labelled with a triangle). 

Reiferencpg 

Amorik, A Yu , Swaminathan, S., Krantz, B.A, Vinson. KD., and 

^^27 " "^^'^-'■^^ «.in Chains by 
y^s, Ubp,4 modula.es rates of protein de^datton by the proteasome. 
tlMBO J. 16:4826-4838. 

Baker R.T. Tobias JW, Varshavsky A. 1992. Ubiquitin-spedf,c 
proteases of Saccharomyces oe^sise. J. Biol Chem 267:23364-23375 

G R. ldent*,cat,on, functional characterisation, and chromosomal 

a systemabc nomenclature for hUSP's Genomics 59, 264-274 (1999, 

Biggins S. et al. J. Cell Biol. 133:1331-1346 (1996) 

Bosch, A, Guimera. J., Gaw, S., Gardiner, K., Chumakov 1 
Patterson, D and Estivlll, X. Integration of 30 CA<epeat markers' into the 

^ogenetic, genetic and YAC maps Of human Chromosome 21 Eur J Hum 
Genet. 4, 135-142 1996. 



.0079267A2_t.> 



wo 00/79267 PCT/GBOO/02446 

24 

Bosnar MH, Pavelic K, Krizanac S, Slobodnjak Z, Pavellc J. 1997° 
Squamous cell lung carcinomas: the role of nm23-H1 gene. J Mol Med 
75:609-613. 

Brodsky, G., Otterson, G.A, Parry, B.B.. Hart, I., Patterson, D., and 
5 Kaye, F.J. Localization of STCH to Human Chromosome 21 q1 1 . 1 . 
Genomics, 30:627-628, 1995. 

Cavallles, V., Dauvois, S., L'Horset, F., Lopez, G., Hoare, S., 
Kushner, P.J.. and Parker, M.G. Nuclear factor RIP140 nodulates 
transcriptional activation by the estrogen receptor. EMBO J., 14:3741-3751, 
10 1995. 

Cummings C.J. et al.. Neuron 24:879-892 (1999) 
d' Andrea, A and Pellman, D. Deubiquitinating enzymes:A new class of 
biological regulators. Genomics 62, 395-405 1999. 

Everett, R.D., Meredith, M., Orr, A.. Cross, A., Kathoria, M., and 
15 Parkinson, J.A novel ubiquitrn-specific protease is dynamically associated 
with the PML nuclear domain and binds to a herpesvirus regularory protein. 
EMBO J., 16:1519-1530, 1997 

Fero, M.L., Randel, E., Gurley, KE.. Roberts, J.M., and Kemp, C.J. 
The murine gene p27 is haplo-insufficient for tumour suppression. 
20 Nature, 396:177-180, 1998. 

Fujiwara, T., Saito, A.., Suzuki, M., Shinomiya, H., Suzuki, T., 
Takahashi, E., Tanigami, A., Ichiyama, A., Chung, C.H., Nakamura, Y., and 
Tanaka, K. Identification and chromosomal assignment of USP1, a novel 
gene encoding a human ubiquilin-specific protease. Genomics, 54:155-158, 
25 1 998. 

Gardiner K. 1996. Base composition and gene distribution: critical 
patterns in mammalian genome organisation. Trends Genet. 12:519-524. 
Gong L etal. J.Biol.Chem. 272(45):281 98-28201 (1997) 
Gong L. et al. J.Biol.Chem. 275(5):3355-3359 (2000a) 
30 Gong L. et al. J.Biol.Chem. 275(12):14212-14216 (2000b) 



NSDOCID: <WO 0079267A2_L> 



10 



15 



20' 



25 



30 



WO 00/79267 

PCT/GBOO/02446 

25 

Gostissa M. Hengstermann A. Fogal V. Sandy P, Schwarz SE 
Scheffner M. Del Sal G (1 999) Activation of p53 by conjugation to the 
ubiquitin-like protein SUMO-1. EMBO J 1999 Nov 15;18{22);6462-71. 

Groet, J.. Ives. J.H.. South A.P.. Baptista. P.B.. Jones. T.A.. Yaspo 
M-L., Lehrach, H.. Potie, M-C. Van Broeckhoven. C. Nizetic. D. Bacterial 
contig map of the 21 q1 1 region associated with Abheimer's disease and 
abnormal myelopoiesis in Down syndrome. Genome Res.. 8:385-398. 1998 

Groet. J.. Ives. J.H.. Jones. T.A.. Danton. M.. Flomen. R.H., Sheer, 
D.. Hrasvcan. R.. Pavelic. K.. and Nizetic. D. Narrowing of the Region of ' 
Allelic Loss in 21q 11-21 in squamous Non-small cell Lung Carcinoma and 
Cloning of a Novel Ubiquitin-specific Protease Gene from the Deleted 
Segment. Genes Chromosomes Cancer 27:153-161 (2000). 

Hansen-Hagge TE, Janssen JWG, Hameister H. Papa FR. Zechner 
U. Seriu T. Jauch A. Becks D, Hochstrasser M, Bartram CR. 1998. An 
evolutionarily conserved gene on human chromosome 5q33-q34. UBH1. 
encodes a novel deubiquitinating enzyme. Genomics 49:411-418. 

Haupt Y. Maya R, Kazaz A. Oren M (1997) Mdm2 promotes the rapid 
degradation of p53, Nature 387:296-299. 

Ichikawa. H.. Hosoda. F.. Aral. Y.. Shimizu. K.. Ohira. M.. and Ohki 
M. A Not I restriction map of the entire long am, of human chromosome 21 . 
Nat. Genet., 4:361-366 1993. 

Jensen DE. Proctor M. Marquis ST. Gardner HP. Ha SI, Chodosh LA 
Ishov AM. Tommerup N. Vissing H. Sekido Y. Minna J, Borodovsky A 
Schultz DC. Wilkinson KD. Maul GG. Barlev N. Berger SL. Predergast GC 
Rauscher FJ III. 1998. BAP1: a novel ubiquitin hydrolase which binds to the 
BRCA1 RING finger and enhances BRCA1 - mediated cell growth 
suppression. Oncogene 1 6: 1 097-1 112. 

Kamitani T. et al. J.Biol.Chem. 272(45):28557-28562 (1997) 
Kamitani T. et al. J.Biol.Chem. 273(1 8): 11 349-1 1353 (1998a) 
Kamitani T. et al. J.Biol.Chem. 273(6):31 17-3120 (1998b) 



SDCX:iD: <WO. 



.0079267A2_I_> 



wo 00/79267 PCT/GBOO/02446 

26 

Kashuba, V.I., Grizatullin, R.Z, Protopopov Al, Allikmets R, Korolev 
S., Li J, Boldog R. Tory K, Zabarovska V, Marcsek Z, Sumegi J. Klein G, 
Zabarovsky ER, Kisselev L 1997. NotI linking^umping clones of human 
chromosome 3: mapping of the TFRC. RAB7 and HAUSP genes to regions 
5 rearranged in leukemia and deleted in solid tumours. FEBS Lett 419:181- 
185. 

Kim I K. et al. J.Biol.Chem. 275(19):14102-14106 (2000) 
Kohno. T.. KawanishI, M., Matsuda, S., Ichikawa, H., takada, M., 

Ohki, M., Yammamoto. T., and Yokota. J. Homozygous deletion and frequent 
10 allelic loss of the 21q1 1 .1-q21 .1 region including the ANA gene in human 

lung carcinoma. Genes Chromosom. Cancer, 21:235-243, 1998. 
Kubbutat M.H.G. et al. Nature 387:299-303 (1997) 
Kumar S. et al. J.Biol.Chem. 274(26)18785-18792 (1999) 
Lane D. 1998. Awakening angels. Nature 394:616-617. 
15 LapentaV. etal. Genomics 40:362-366 (1997) 

Lee G.W. et al. J.Biol.Chem. 273(1 1):6503-6507 (1998) 

Lennon GG. Auffray C. Polymeropoulos M, Scares MB. 1996. The 

I.M.A.G.E. Consortium: an integrated molecular analysis of genomes and 

their expression. Genomics 33: 151-1 52. 
20 Li S.J. and Hochstrasser M., Nature 398:246-251 (1999) 

Mahajan R. et al. Cell 88:97-107 (1997) 

Mannen H. el al. Biochem.Biophys.Res.Comm. 222:178-180 (1996) 

Masutani C. et al. EMBO J. 13(8):1 831 -1843 (1994) 

Matunis M.J. et al. J.Cell Biol. 140(3):499-509 (1998) 
25 Meluh P.B. and Koshland D., Mol. Biol. Cell 6:793-807 (1995) 

Nizetic, D., Gellen L., Hamvas R., Mott R., Grigoriev A., Vatcheva 
R.,Zehetner G., Yaspo M L., Dutriaux A., Lopes C, Delabar J.M., Van 
Broeckhoven C, Potier M.C. & Lehrach H.; An integrated YAC overlap and 
"cosmid-pocket" map of the human chromosome 21 . Human Molecular 
30 Genetics 3(5):759-770 (1994). 

OkuraT. etal. J.lmmunol. 157:4277-4281 (1996) 



NSOOCID: <WO 0079267A2_t_> 



10 



15 



20 



25 



30 



WO 00/79267 

PCT/GBOO/02446 

27 

Oster. J.. Mikkelsen., A. Mortality and life-table in Down syndrome" 
Acto Paediatr. Scand., 64: 322-326, 1975. 

Robinson PA, Lomonte P, Leek P. Markham AF. Everett RD 1998 
Assignment of herpesvirus-associated ubiquitin-specific protease gene 
HAUSP to human chromosome band 16p 13.3 by in situ hybridisation 
Cytogenet. Cell Genet. 83:100. 

Rodriguez MS. Desterro JM. Lain S. Midgley CA. Lane DP. Hay RT 
(1999) SUMO-1 modification activates the transcriptional response of pSS. 
EMBO J 1999 Nov 15;18(22):6455-61. 

Saitoh H. etal. Proc. Natl. Acad. Sci. USA 94:3736-3741 (1997) 
Saitoh H. and Hinchey J.. J.Biol.Chem. 275(9)6252-6258 (2000) 
Sambrook. J.. Fritsch. E.F., & Mamiatis, T Molecular Cloning:a 
laboratory manual 2nd ed. Coldspring Harbor, N. Y. 1 989. 

Satge. D.. Sasco. A.J., Geneix. A., and Malet. P. Another reason to 
look for tumor suppressor genes on chromosome 21 . Genes Chromosomes 
Cancer 21:1, 1998. 

Saudou F. et al. Cell 95:55-66 (1998) 
Scheffner M. et al. Cell 75:495-505 (1 993) 

Scheffner M. et al. The Ubiquitin-Conjugation System. In: J.M Peters 
D. F,nley and R. Harris (eds.). Ubiquitin and the Biology of the Cell. Plenum' 
Press, New York, (1998). 

Schwarz S.E. et al. Proc.Natl.Acad.Sci.USA 95:560-564 (1998) 

Shen Z. et al. Genomics 36:271-279 (1996a) 

Shen 2. et al. Genomics 36: 1 83-1 86 (1 996b) 

Shimizu. N.. Antonarakis, S.. VanBroeckhoven. C, Patterson D 
Gardiner. K.. Nizetic. D.. Creau, N., Delabar. J., Korenberg. J.. Reeves R 
Doenng. J.. Rjtter, 0.. and Cuticchia, J. Report of the fifth international 
workshop on human chromosome 21 mapping 1994. Cytogenet. Cell 
Genet., 70:147-182, 1995. 

Spaventi R., Pecur, L., Pavelic, K.. Pavelic. Z.P., Spaventi. S. and 
Sambrook. P.J. Human Tumour Bank in Croatia: as possible model for a 



SDCXilD: <WO 0079267A2_t_> 



wo 00/79267 PCT/G BOO/02446 

28 

small bank as part of the future European Tumour Bank Network. Eur. J. 
Cancer 30A, 419-1994. 

Swanson DA, Freund CL, Ploder L. Mclnnes RR, Valle D. 1996. A 
ubiquitin C-terminal hydrolase gene on the proximal short arm of the X 
5 chromosome: implications for X-linked retinal disorders. Hum. Mol. Genet 
5:533-538. 

The C. elegans Sequencing Consortium, 1998. Genome sequence of 
the nematode C. elegans: a platfonn for Investigating biology. Science 
282:2012-2018. 

10 Valero, R., Martany, G., Gonzalez-Angulo, O., Gonzalez-Gonzalez, 

G. , Puelles. L and Gonzales-Duarte, R. USP25, a Novel Gene Encoding a 
Deubiquitinating Enzyme, is located in the Gene-Poor Region 21q1 1.2. 
Genomics 62: 395-405 (1999). 

Watkins J.F. et al. Mol. Cell Biol. 13:7757-7765 (1993) 
15 Wilkinson. K.D. (1997). Regulation of ubiquitin-dependent processes 

by deubiquitinating enzymes. FASEB J. 1 1: 1245-1256. 

Wilkinson, K.D., and Hochstrasser, M. Deubiquitinating enzymes. In: 
J.M. Peters. D. Finley and R. Harris (eds.), Ubiquitin and the Biology of the 
Cell. Plenum Press. New York, 1998. 
20 Wolozin, B. et al Regulation of Apoptosis by Presenilin 1 . 

Neurobiology of Aging. Vol. 19. (15) p 523-527, 1998. 

Yaspo M-L, Gellen L, Mott R, Korn B, Nizetic D, Poustka A, Lehrach 

H. 1995. Model for a transcript map of human chromosome 21 : isolation of 
new coding sequences from exon and enriched cDNA libraries. Hum. Mol. 

25 Genet. 4:1291-1304. 



VSDOCIDkWO 0O79267A2 I > 



wo 00/79267 

PCT/GBOO/02446 

29 

CLAIMS 

1 . Use of the product of the USP25 gene located at human 
Chromosome 21q 11-21 or a non-human homologue thereof or a functional 
fragment of the product manufacture of a medicament for use in the 

5 diagnosis, prophylaxis or treatment of cancer. 

2. Use according to claim 1 in which the cancer is a solid 
tumour, for instance non-small cell lung carcinoma or skin cancer. 

3. Use according to claim 1 or claim 2 in which the product is 
obta-ned from a microoganism or cell line in which the gene as recombinant 
DNA has been introduced in a form in which it is replicated upon cell 
division, and is transcribed and translated. 

4. Use according to claim 3 in which the recombinant DNA is 
incorporated in a vector which comprises at least a portion of sequence I D 6 
comprising residues 199 to 3367. optionally including an additional 96 b p " 
exon inserted after base 2356. and optionally including 5' and/or 3' 
untranslated regions (UTR's). 



10 



15 



5. 



Use of a protein product having Cys. QQD and His domains 
specified in sequence IDs numbers 2. 3 and 4. respectively, in the 
manufacture of a medicament for use in the diagnosis, prophylaxis or 
20 treatment of cancer. 



6. 



I D No 1 ^""""'"""^ ^ '''"^ ^^''"^"^^ 

7. Use according to claim 5 in which the protein has sequence 
I.D. No. 5. 

25 8. An in vitro method in which mammalian cells are cultured in 

the presence of a protein product having Cys. QQD and His domains 
specified in sequence IDs numbers 2. 3 and 4. respectively, and the effect of 
protein on cell growth, cell growth arrest and/or apoptosis is assessed. 
9. A method according to claim 8 in which the protein has 

30 sequence I.D. 1 or sequence I.D.5. 
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1 0. Use of DNA including the gene located at human 
chromosome 21 q 1 1-21 (USP25) or a fragment thereof encoding a functional 
USP or UbLSP product in the manufacture of a medicament for use in the 
prophylaxis or treatment of cancer. 
5 11, Pharmaceutical composition comprising a gene therapy 

vehicle and DNA including the gene located at human chromosome 21q 1 1- 
21 (USP25) or a fragment thereof encoding a functional USP or UbLSP 
product in transcribable form. 
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^cQ ID 1 protein: 

MTVEQNVLQQSMQKHQQTFLNQLREITGINDTQILQQALKDSNGNLELAV 

AFLTAKNAKTPQQEETTYYQTALPGNDRYISVGSQADTNVIDLTGDDKDDL 

QRAIALSLAESNRAFRETGITDEEQAISRVLEASIAENKACLKRTPTEVWRD 

SRNPYDRKRQDKAPVGLKNVGNTCWFSAVIQSLFNLLEFRRLVLNYKPPS 

NAQDLPRNQKEHRNLPFMRELRYLFALLVGTKRKYVDPSRAVEILKDAFK 

SNDSQQQDVSEFTHKLLDWLEDAFQMKAEEETDEEKPKNPMVELFYGRF 

LAVGVLEGKKFENTEMFGQYPLQVNGFKDLHECLEAAMIEGEIESLHSEN 

SGKSGQEHWFTELPPVLTFELSRFEFNQALGRPEKIHNKLEFPQVLYLDR 

YMHRNREITRIKREEIKRLKDYLTVLQQRLERYLSYGSGPKRFPLVDVLQYA 

LEFASSKPVCTSPVDDIDASSPPSGSIPSQTLPSTTEQQGALSSELPSTSPS 

SVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHITEEELSVLESCLHRWRTEIE 

NDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVHEGQANAGHYWAY 

IFDHRESRWMKYNDIAVTKSSWEELVRDSFGGYRNASAYCLMYINDKAQFL 

IQEEFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEEWDAQLAQKALQ 

EKLLASQKLRESETSVTTAQAAGDPEYLEQPSRSDFSKHLKEETIQIITKASH 

EHEDKSPETVLQSAIKLEYARLVKUQEDTPPETDYRLHHNAAA'FIQNQAPK 

KIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQD 

YRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELI 

SHYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEEK 

DILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPKL 
PSYSTHELCERFARIMLSLSRTPADGR 
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SEQ ID no 2: (polypeptide) 
GLKNVGNTCWFSAVIQSL 

SEQ ID no 3: (polypeptide) 
5 QQDVSEFTHKLLDWLED 

SEQ ID no 4: (polypeptide) 
YRLHAVLVHEGQANAGHYWAY 

10 SEQ ID no 5: (polypeptide) 

MTVEQNVLQQSAAQKHQQTFLNQLREITGINDTQILQQALKDSNGNLELA 
VAFLTAKNAKTPQQEETTYYQTALPGNDRYISVGSQADTNVIDLTGDDKD 
DLQRAIALSLAESNRAFRETGITDEEQAISRVLEASIAENKACLKRTPTEVW 
RDSRNPYDRKRQDKAPVGLKNVGNTCWFSAVIQSLFNLLEFRRLVLNYKP 

15 PSNAQDLPRNQKEHRNLPFMRELRYLFALLVGTKRKYVDPSRAVEILKDA 
FKSNDSQQQDVSEFTHKLLDWLEDAFQMKAEEETDEEKPKNPMVELFY 
GRFLAVGVLEGKKFENTEMFGQYPLQVNGFKDLHECLEAAMIEGEIESLH 
SENSGKSGQEHWFTELPPVLTFELSRFEFNQALGRPEKIHNKLEFPQVLY 
LDRYMHRNREITRIKREEIKRLKDYLTVLQQRLERYLSYGSGPKRFPLVDV 

20 LQYALEFASSKPVCTSPVDDIDASSPPSGSIPSQTLPSTTEQQGALSSELP 
STSPSSVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHITEEKLSVLESCLHR 
WRTEIENDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVHEGQANA 
GHYWAYIFDHRESRWMKYNDIAVTKSSWEELVRDSFGGYRNASAYCLM 
YINDKAQFLIQEEFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEEW 

25 DAQLAQKALQEKLLASQKLRESETSVTTAQAAGDPEYLEQPSRSDFSKH 
LKEETIQIITKASHEHEDKSPETVLQSIMMTPNMQGIIMAIGKSRSVYDRCG 
PEAGFFKAIKLEYARLVKLAQEDTPPETDYRLHHWVYFIQNQAPKKIIEKT 
LLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQDYRK 
FRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELIS 

30 HYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEE 
KDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEP 
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SEQ ID no 6: 
dsDNA 

>BASE COUNT 1223 a 752 c 841 g 987 1 
>ORI6IN 

> 1 acagtcggcg tttcgccgcc tgcccgcggt gcccgcgcac gccggccgcc atcgccttcg 

> 61 cgcctggctg gcgggggcgc tgtcctccca ggccgtccgc gccgctccct ggagctcggc 

> 121 ggagcgcggc agccagggcc ggcggaggcg cgaggagccg ggcgccaccg ccgccgccgc 

> 181 cgccgccgcc gcgggggcca tgaccglgga gcagaacgtg ctgcagcaga gcgcggcgca 

> 241 gaagcaccag cagacgtttt tgaa.caact gagagaaatt acggggatta algacaccca 

> 301 gatadacag caagccttga aggataglaa tggaaacttg gaattagcag iggctttcct 

> 361 tactgcgaag aatgctaaga cccctcagca ggaggagaca acttadacc aaacagcact 

> 421 tcctggcaat gatagataca tcagtgtggg aagccaagca gatacaaatg igattgatct 

> 481 cactggagat gataaagatg atcttcagag agcaattgcc ttgagtttgg ccgaatcaaa 

> 541 cagggcatfc agggagactg gaataadga tgaggaacaa gccattagca gagttcttga 

> 601 agccagcata gcagagaata aagcatgttt gaagaggaca cctacagaag tttggaggga 

> 661 ticlcgaaac ccttatgata gaaaaagaca ggacaaagct cccgtlgggc taaagaatgt 

> 721 tggcaatact tgtlggttta gtgdgttat tcagtcatta tltaatcttl tggaatttag 

> 781 aagattagtt ctgaattaca agcctccatc aaatgctcaa gatttacccc gaaaccaaaa 

> 841 ggaacatcgg aatttgccit ttatgcgtga gctgaggtat cfatttgcac ttcttgttgg 

> 901 taccaaaagg aagtatgttg atccatcaag agcagttgaa attcttaagg atgctttcaa 

> 961 atcaaatgac tcacagcagc aagatglgag tgagtttaca cacaaattat lagattggtt 

> 1021 ^Saagatgccltccaaatgaaagctgaagaggagacggatgaagagaagc caaagaaccc 
1081 catggtagag ttgttcta.g gcagattcct ggctgtggga gtacttgaag gtaaaaaatt 

> 1 141 tgaaaacact gaaatgtttg gtcagtaccc acttcaggtc aatgggttca aagatctgca 

> 1201 tgagigccta gaagctgcaa tgattgaagg agaaattgag tcfttacatt cagagaattc 
1261 aggaaaatca ggccaagagc attggtttac tgaattacca cctgtgttaa catttgaatt 
1321 gtcaagaltt gaatttaatc aggcattggg aagaccagaa aaaattcaca acaaattaga 
1381 atttccccaa gttttatatt tggacagata catgcacaga aacagagaaa taacaagaat 
1441 taagagggaa gagatcaaga gactgaaaga ttacctcacg gtattacaac aaaggctaga 
1501 aagatattta agctatggtt ccggtcccaa acgattcccc ttggtagatg ttcttcagta 
1561 tgcattggaa tftgcclcaa gtaaacctgt ttgcacttct cctgttgacg atattgacgc 
1621 tagttcccca cdagtggtt ccataccatc acagacatta ccaagcacaa cagaacaaca 
1681 gggagccda tdtcagaac tgccaagcac atcacdtca tcagttgdg ccatttcatc 
1741 gagatcagta alacacaaac catttadca gtcccggata a:tccagalt tgcccatgca 
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1801 tccggcacca aggcacataa cggaggaaga actttctgtg ctggaaagtt gtttacatcg 
1861 ctggaggaca gaaatagaaa atgacaccag agatttgcag gaaagcatat ccagaatcca 
1921 tcgaacaatt gaattaatgt actctgacaa atctatgata caagttcctt atcgattaca 
1981 tgccgtttla gttcacgaag gccaagctaa tgctgggcac tactgggcat atatttttga 
2041 tcatcgtgaa agcagatgga tgaagtacaa tgatattgct gtgacaaaat catcatggga 
2101 agagctagtg agggactctt ttggtggtta tagaaatgcc agtgcatact gtttaatgta 
2161 cataaatgat aaggcacagt tcctaataca agaggagttt aataaagaaa ctgggcagcc 
2221 ccttgttggt atagaaacat taccaccgga tttgagagat tttgttgagg aagacaacca 
2281 acgatttgaa aaagaactag aagaatggga tgcacaactt gcccagaaag ctttgcagga 
2341 aaagctttta gcgtctcaga aattgagaga gtcagagact tctgtgacaa cagcacaagc 

> 2401 agcaggagac ccagaatatc tagagcagcc atcaagaagt gatttctcaa agcacttgaa 

> 2461 agaagaaaci attcaaataa ttaccaaggc atcacatgag catgaagata aaagtcctga 

> 2521 aacagttttg cagtcggcaa ttaagttgga atatgcaagg ttggttaagt tggcccaaga 

> 2581 agacacccca ccagaaaccg attatcgttt acatcatgta gtggtctact ttatccagaa 

1 5 > 2641 ccaggcacca aagaaaatta ttgagaaaac attactagaa caatttggag atagaaattt 

> 2701 gagttttgat gaaaggtgtc acaacataat gaaagttgct caagccaaac tggaaatgat 

> 2761 aaaacctgaa gaagtaaact tggaggaata tgaggagtgg catcaggatt ataggaaatt 

> 2821 cagggaaaca actatgtatc tcataattgg gctagaaaat tttcaaagag aaagttatat 

> 2881 agattccttg ctgttcctca tctgtgctta tcagaataac aaagaadct tgtctaaagg 
20 > 2941 cttatacaga ggacatgatg aagaattgat atcacattat agaagagaat gtttgctaaa 

> 3001 attaaatgag caagccgcag aactcttcga atctggagag gatcgagaag taaacaatgg 

> 3061 tttgattatc atgaatgagt ttattgtccc atttttgcca ttattactgg tggatgaaat 

> 3121 ggaagaaaag gatatactag ctgtagaaga tatgagaaat cgatggtgtt cctaccttgg 

> 3181 tcaagaaatg gaaccacacc tccaagaaaa gctgacagat tttttgccaa aactgcttga 
25 > 3241 ttgttctatg gagattaaaa gtttccatga gccaccgaag ttaccttcat attccacgca 

> 3301 tgaaclctgt gagcgatttg cccgaatcat gttgtccctc agtcgaactc ctgctgatgg 
3361 aagataaact gcacactttc cdgaacaca ctgtataaac tctttttagl tcttaaccct 
3421 tgccttcctg tcacagggtt tgcltgttgc tgctatagtt tttaaclttt ttttatttta 
3481 ataactgcaa aagacaaaat gactatacag actttagtca gactgcagac aataaagctg 

30 > 3541 aaaatcgcat ggcgctcaga cattttaacc ggaactgatg tataatcaca aatctaattg 

> 3601 attttattat ggcaaaacta igcttttgcc accttcctgt tgcagtatta ctttgctttt 

> 3661 atcttttctt tctcaacagc tttccattca gtctggatcc ttccatgact acagccattt 

> 3721 aagtgttcag caclgtglac gatacataal atttggtagc ttglaaatga aataaagaat 

> 3781 aaagttttat ttatggctac da 
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SEQ ID no 7 dsDNA 

> ca tgaccgtgga gcagaacgtg dgcagcaga gcgcggcgca 

> 241 gaagcaccag cagacgtttt tgaatcaad gagagaaatt acggggatta atgacaccca 
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> 301 gatactacag caagccttga aggatagtaa tggaaacttg gaattagcag tggctttcct 

> 361 tactgcgaag aatgctaaga cccctcagca ggaggagaca acttactacc aaacagcact 

> 421 tcctggcaat gatagataca tcagtgtggg aagccaagca gatacaaatg tgattgatct 

> 481 cactggagat gataaagatg atcttcagag agcaattgcc ttgagtttgg ccgaatcaaa 

> 541 cagggcattc agggagactg gaataactga tgaggaacaa gccattagca gagttcttga 

> 601 agccagcata gcagagaata aagcatgttt gaagaggaca cctacagaag ttlggaggga 

> 661 ttctcgaaac ccttatgata gaaaaagaca ggacaaagct cccgttgggc taaagaatgt 

> 721 tggcaatact tgttggttta gtgctgttat tcagtcatta tttaatdtt tggaatttag 

> 781 aagattagtt ctgaattaca agcctccatc aaatgctcaa gatttacccc gaaaccaaaa 

> 841 ggaacatcgg aatltgcctt ttatgcgtga gctgaggtal clatttgcac ttcttgttgg 

> 901 taccaaaagg aagtatgttg atccatcaag agcagttgaa attcttaagg atgctttcaa 

> 961 atcaaatgac tcacagcagc aagatgtgag tgagtttaca cacaaattat tagattggtt 

> 1021 ^Saagatgccttccaaatgaaagctgaagaggagacggatgaagagaagccaaagaaccc 

> ■'Oeicatggtagagltgltctatggcagattcctggctglgggagtacttgaaggtaaaaaatt 

> 1141 tgaaaacact gaaatgtttg gtcagtaccc actfcaggtc aatgggttca aagatctgca 

> 1201 tgagtgcctagaagctgcaa tgattgaagg agaaaltgag tctttacatt cagagaattc 

> 1261 aggaaaatca ggccaagagc attggmac tgaatlacca cctglgttaa catttgaatt 

> 1321 gtcaagattt gaatttaatc aggcattggg aagaccagaa aaaattcaca acaaattaga 
1381 attlccccaagttttatatttggacagatacatgcacagaaacagagaaataacaagaat 
1441 taagagggaa gagatcaaga gactgaaaga tlacctcacg gtattacaac aaaggctaga 
1501 aagatattta agctatggtt ccggtcccaa acgattcccc ttggtagatg ttcttcagta 
1561 tgcattggaa tttgcctcaa gtaaacctgt ttgcacttd cctgttgacg atattgacgc 
1621 tagttcccca cctagtggtt ccataccatc acagacatta ccaagcacaa cagaacaaca 
1681 gggagcccta Icttcagaac tgccaagcac atcaccttca tcagttgctg ccatttcatc 
1741 gagatcagta atacacaaac catttactca gtcccggata cctccagatt tgcccatgca 
1801 tccggcacca aggcacataa cggaggaaga actttctgtg ctggaaagtt gtttacatcg 
1861 ctggaggaca gaaatagaaa atgacaccag agatttgcag gaaagcatat ccagaatcca 
1921 tcgaacaatt gaattaatgt actctgacaa atctatgata caagttcctt atcgattaca 
1981 tgccgtttta gttcacgaag gccaagcaa tgctgggcac tactgggcat atatttttga 
2041 tcatcgtgaa agcagatgga tgaagtacaa tgatattgct gtgacaaaat catcatggga 
2101 agagctagtg agggactctt ttggtggtta tagaaatgcc agtgcatact gtttaatgta 
2161 cataaatgat aaggcacagt Icctaataca agaggagttt aataaagaaa ctgggcagcc 
2221 ccttgttggt atagaaacat taccaccgga tttgagagat tttgttgagg aagacaacca 
2281 acgamgaa aaagaactag aagaatggga tgcacaactt gcccagaaag ctttgcagga 
2341 aaagcttta gcgtCcaga aattgagaga gtcagagact tctgtgacaa cagcacaagc 
2401 agcaggagac ccagaatatc tagagcagcc atcaagaagt gatttctcaa agcacttgaa 
2461 agaagaaact attcaaataa ttaccaaggc atcacatgag catgaagata aaagtcctga 
2521 aacagttttg cagtcggcaa ttaagttgga alatgcaagg ttggttaagt tggcccaaga 
2581 agacacccca ccagaaaccg attatcgttt acatcatgta giggtctact ttatccagaa 
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> 2641 ccaggcacca aagaaaatta ttgagaaaac attactagaa caatttggag atagaaattt 

> 2701 gagttttgat gaaaggtgtc acaacataat gaaagttgct caagccaaac tggaaatgat 

> 2761 aaaacctgaa gaagtaaact tggaggaata tgaggagtgg catcaggatt ataggaaatt 

> 2821 cagggaaaca actatgtatc tcataattgg gctagaaaat tttcaaagag aaagttatat 

> 2881 agattccttg ctgttcctca tctgtgctta tcagaataac aaagaactct tgtctaaagg 

> 2941 cttatacaga ggacatgatg aagaattgat atcacattat agaagagaat gtttgctaaa 
3001 attaaatgag caagccgcag aactcttcga atctggagag gatcgagaag taaacaatgg 
3061 tttgattatc atgaatgagt ttattgtccc atttttgcca ttattactgg tggatgaaat 

> 3121 ggaagaaaag gatatactag ctgtagaaga tatgagaaat cgatggtgtt cctaccttgg 
1 0 > 3181 tcaagaaatg gaaccacacc tcxaagaaaa gctgacagat tttttgccaa aactgcltga 

> 3241 tlgttctatg gagallaaaa gtttccatga gccaccgaag ttaccttcat attccacgca 

> 3301 tgaactctgt gagcgatttg cccgaatcat gttgtccctc agtcgaaclc ctgctgatgg 

> 3361 aagataa 
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SEQUENCE LISTING 
<110> School Of Pharmacy ^ 
<120> Treatment of Cancer 
<130> HMJ03303WO 

<140> PCT/GB 0002446 
<141> 2000-06-22 

<150> 9914589. 6 
<151> 1999-06-22 

<150> 0008161.2 
<151> 2000-04-03 

<160> 7 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 1055 
<212> PRT 

<213> Homo sapiens 
<400> 1 

Met Thr Val Glu Gin Asn Val Leu Gin Gin Ser Ala Ala Gin Lys His 

Gin Gin Thr Phe Leu Asn Gin Leu Arg Glu He Thr Gly He Asn Asp 

25 30 

Thr Gin lie Leu Gin Gin Ala Leu Lys Asp Ser Asn Gly Asn Leu Glu 
35 40 45 

Leu Ala Val Ala Phe Leu Thr Ala Lys Asn Ala Lys Thr Pro Gin Gin 
^° 55 60 

Glu Glu Thr Thr Tyr Tyr Gin Thr Ala Leu Pro Gly Asn Asp Arg Tyr 

"^^ 80 
lie ser Val Gly Ser Gin Ala Asp Thr Asn Val He Asp Leu Thr Gly 



Asp Asp Lys Asp Asp Leu Gin Arg Ala He Ala Leu Ser 



100 



105 



Leu Ala Glu 
110 
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Ser Asn Arg Ala Phe Arg Glu Thr Gly lie Thr Asp Glu Glu Gin Ala 
115 120 125 

lie Ser Arg Val Leu Glu Ala Ser lie Ala Glu Asn Lys Ala Cys Leu 
130 135 140 

Lys Arg Thr Pro Thr Glu Val Trp Arg Asp Ser Arg Asn Pro Tyr Asp 
145 150 155 160 

Arg Lys Arg Gin Asp Lys Ala Pro Val Gly Leu Lys Asn Val Gly Asn 
165 170 175 

Thr Cys Trp Phe Ser Ala Val lie Gin Ser Leu Phe Asn Leu Leu Glu 
180 185 190 

Phe Arg Arg Leu Val Leu Asn Tyr Lys Pro Pro Ser Asn Ala Gin Asp 
195 200 205 

Leu Pro Arg Asn Gin Lys Glu His Arg Asn Leu Pro Phe Met Arg Glu 
210 215 220 

Leu Arc Tyr Leu Phe Ala Leu Leu Val Gly Thr Lys Arg Lys Tyr Val 
225 230 - 235 - 240 

Asp Pre Ser Arg Ala Val Glu lie Leu Lys Asp Ala Phe Lys Ser Asn 
245 250 255 

Asp Ser Gin Gin Gin Asp Val Ser Glu Phe Thr His Lys Leu Leu Asp 
260 265 270 

Trp Leu Glu Asp Ala Phe Gin Met Lys Ala Glu GlU Glu Thr Asp Glu 
275 280 285 

Glu Lys Pro Lys Asn Pro Met Val Glu Leu Phe Tyr Gly Arg Phe Leu 
290 295 300 

Ala Val Gly Val Leu Glu Gly Lys Lys Phe Glu Asn Thr Glu Met Phe 
305 310 315 320 

Gly Gin Tyr Pro Leu Gin Val Asn Gly Phe Lys Asp Leu His Glu Cys 
325 330 335 

Leu Glu Ala Ala Met lie Glu Gly Glu lie Glu Ser Leu His Ser Glu 
340 345 350 

Asn Ser Gly Lys Ser Gly Gin Glu His Trp Phe Thr Glu Leu Pro Pro 
355 360 365 
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Val Leu Thr Phe Glu Leu Ser Arg Phe Glu Phe Asn Gin Ala Leu Gly 

375 

Arg Pro Glu Lys He His Asn Lys Leu Glu Phe Pro Gin Val Leu Tyr 

"° 395 400 

Leu ASP Arg Tyr Met His Arg Asn Arg Glu He Thr Arg He Lys Arg 

410 415 

Glu Glu He Lys Arg Leu Lys Asp Tyr Leu Thr Val Leu Gin Gin Arg 

Leu Glu Arg Tyr Leu Ser Tyr Gly Ser Gly Pro Lys Arg Phe Pro Leu 
035 440 445 

val Asp val Leu Gin Tyr Ala Leu Glu Phe Ala Ser Ser Lys Pro Val 

055 4go 

Cys Thr ser Pro Val Asp Asp He Asp Ala Ser Ser Pro Pro Ser Gly 
065 470 

475 480 
ser He Pro Ser Gin Thr Leu Pro Ser Thr Thr Glu Gin Gin Gly Al 



w —a 

485 AQf) 



Leu Ser Ser Glu Leu Pro Ser Thr 
500 



Ser Pro Ser Ser Val Ala Al 



a lie 



505 



510 



Ser Arg He Pro 
525 



Ser ser Arg Ser Val He His Lys Pro Phe Thr Gin 

520 

Pro Asp Leu Pro Met His Pro Ala Pro Arg His He Thr Glu Glu Glu 
"° 535 540 

Leu ser Val Leu Glu Ser cys Leu His Arg Trp Arg Thr Glu He Glu 

. 555 

Asn ASP Thr Arg Asp Leu Gin Glu Ser He Ser Arg He His Arg Thr 
565 570 575 



He Glu Leu Met Tyr Ser Asp Lys Ser 



580 



585 



Met He Gin Val Pro Tyr Arg 



590 



Leu His Ala Val Leu Val His Glu Gly Gin' Ala Asn Ala Gly His Tyr 



600 



605 



Trp Ala Tyr He Phe Asp His Arg Glu Ser Arg Trp Met Lys Tyr Asn 
"° 615 
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Asp He Ala Val Thr Lys Ser Ser Trp Glu Glu Leu Val Arg Asp Ser 
625 630 635 640. 

Phe Gly Gly Tyr Arg Asn Ala Ser Ala Tyr Cys Leu Met Tyr He Asn 
645 650 655 

Asp Lys Ala Gin Phe Leu He Gin Glu Glu Phe Asn Lys Glu Thr Gly 
660 665 670 

Gin Pro Leu Val Gly He Glu Thr Leu Pro Pro Asp Leu Arg Asp Phe 
675 680 685 

Val Glu Glu Asp Asn Gin Arg Phe Glu Lys Glu Leu Glu Glu Trp Asp 
690 695 700 

Ala Gin Leu Ala Gin Lys Ala Leu Gin Glu Lys Leu Leu Ala Ser Gin 
705 710 715 720 

Lys Leu Arg Glu Ser Glu Thr Ser Val Thr Thr Ala Gin Ala Ala Gly 
725 730 735 

Asp Pro Glu Tyr Leu Glu Gin Pro Ser Arg Ser Asp Phe Ser Lys His 
740 745 750 

Leu Lys Glu Glu Thr He Gin He He Thr Lys Ala Ser His Glu His 
755 760 765 

Glu Asp Lys Ser Pro Glu Thr Val Leu Gin Ser Ala He Lys Leu Glu 
770 775 780 

Tyr Ala Arg Leu Val Lys Leu 7U.a Gin Glu Asp Thr Pro Pro Glu Thr 
785 790 795 800 

Asp Tyr Arg Leu His His Val Val Val Tyr Phe He Gin Asn Gin Ala 
805 810 815 

Pro Lys Lys He He Glu Lys Thr Leu Leu Glu Gin Phe Gly Asp Arg 
820 825 830 

Asn Leu Ser Phe Asp Glu Arg Cys His Asn He Met Lys Val Ala Gin 
835 840 845 

Ala Lys Leu Glu Met He Lys Pro Glu Glu Val Asn Leu Glu Glu Tyr 
850 855 860 

Glu Glu Trp His Gin Asp Tyr Arg Lys Phe Arg Glu Thr Thr Met Tyr 
865 870 875 880 
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Leu He He Gly Leu Glu Asn Phe Gin Arg Glu Ser Tyr lie Asp Ser 

ggj 

Leu Leu Phe Leu He Cys Ala Tyr Gin Asn Asn Lys Glu Leu Leu Ser 

905 

Lys Gly Leu Tyr Arg Gly His Asp Glu Glu Leu He Ser His Tyr Arg 
915 920 

Arg Glu cys Leu Leu Lys Leu Asn Glu Gin Ala Ala Glu Leu Phe Glu 



930 935 



940 



ser Gly Glu Asp Arg Glu Val Asn Asn Gly Leu He He Met Asn Glu 

"° 955 960 



Phe He val Pro Phe Leu Pro Leu Leu Leu Val Asp Glu Met Glu Glu 
^^5 970 

Lys ASP He Leu Ala Val Glu Asp Met Arg Asn Arg Trp Cys Ser Tyr 
980 985 

Leu Gly Gin Glu Met Glu Pro His Leu Gin Glu Lys Leu Thr Asp Phe 
595 1000 1005 

Leu Pro Lys Leu Leu Asp Cys Ser Met Glu He Lys Ser Phe His Glu 
1010 1015 1020 

Pro Pro Lys Leu Pro Ser Tyr Ser Thr His Glu Leu Cys Glu Arg Phe 
1025 1030 

1035 1040 

Ala Arg He Met Leu Ser Leu Ser Arg Thr Pro Ala Asp Gly Arg 
1045 1050 1055 



<210> 2 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Gly Leu Lys Asn Val Gly Asn Thr Cys Trp Phe Ser Ala Val He Gin 
^ 5 10 15 

Ser Leu 



■SOOCID: <WO 0079267A2J_> 



5 

SUBSTITUTE SHEET (RULE 26) 



wo 00/79267 



PCT/GBOO/02446 



<210> 3 
<211> 17 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Gin Gin Asp Val Ser Glu Phe Thr His Lys Leu Leu Asp Trp Leu Glu 
15 10 15 

Asp 



<210> 4 
<211> 21 
<212> PRT 

<213> Homo sapiens 

<400> 4 

Tyr Arg Leu His Ala Val Leu Val His Glu Gly Gin Ala Asn Ala Gly 
15 10 15 

His Tyr Trp Ala Tyr 
20 



<210> 5 

<211> 1087 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Thr Val Glu Gin Asn Val Leu Gin Gin Ser Ala Ala Gin Lys His 
15 10 15 

Gin Gin Thr Phe Leu Asn Gin Leu Arg Glu lie Thr Gly lie Asn Asp 
20 25 30 

Thr Gin lie Leu Gin Gin Ala Leu Lys Asp Ser Asn Gly Asn Leu Glu 
35 40 45 

Leu Ala Val Ala Phe Leu Thr Ala Lys Asn Ala Lys Thr Pro Gin Gin 
50 55 60 

Glu Glu Thr Thr Tyr Tyr Gin Thr Ala Leu Pro Gly Asn Asp Arg Tyr 

6 
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80 



He Ser Val Gly Ser Gin 
85 



Ala Asp Thr Asn Val He Asp Leu Thr Gly 
50 95 

Asp Asp Lys Asp Asp Leu Gin Ara Ala n*. ni= t 

Ala lie Ala Leu Ser Leu Ala Glu 

110 

ser Asn Arg Ala Phe Arg Glu T.r Gly lie Thr Asp Glu Glu Gin Ala 
''^ 120 125 

He Ser Arg Val Leu Glu Ala s^t'tto m 

130 ^ ^^"^ Glu Asn Lys Ala Cys Leu 

140 



Lys Arg Thr Pro Thr Glu Val Tm nr« n^.^ e 

145 ^""P Arg Asn Pro Tyr Asp 

150 . ^cc 

-^^^ 160 

Arg Lys Arg Gin Asp Lys Ala Pm v=i t 

P i^ys Ala Pro Val Gly Leu Lys Asn Val Gly Asn 

i„ 

Thr Cys Trp Phe Ser Ala Val ll^ ri» c 

^ ^^"^ 1*^^ P*^^ Asn Leu Leu Glu 



"5 190 



Phe .eu val .eu .sn Ty. ,ro Pro Ser Asn .la cm .sp 

"^"^ 205 



I.eu Pro Arg Asn Gin X,ys Olu His Ar, Asn Leu Pro Phe Met Arg Olu 

215 220 

Leu Arg Tyr Leu Phe Ala Leu Leu Val cly Thr Lys Arg Lys Tyr Val 

235 240 

-P P.O s„ «, ^. val ^„ 

"0 

A^P s,r CI. =.„ ol„ ;«p V.1 s„ Olu ... „i= L,, 

265 270 

Trp Leu Clu ASP Ala Phe Sin Met Lys Ala Olu clu 31u Thr Asp Olu 

280 285 

Glu Lys Pro Lys Asn Pro Met Val ri„ t 

290 '^^^ Gly Arg Phe Leu 

295 

"^^^ 300 

Ala val Oly Val Leu Olu Oly Lys Lys Phe Olu Asn Thr Olu Met Phe 

315 



SOOCID: <W0 0079267A2J.. 



SUBSTITUTE SHEET (RULE 26) 



wo 00/79267 PCT/G BOO/02446 

325 330 335 

Leu Glu Ala Ala Met lie Glu Gly Glu lie Glu Ser Leu His Ser Glu 
340 345 350 

Asn Ser Gly Lys Ser Gly Gin Glu His Trp Phe Thr Glu Leu Pro Pro 
355 360 365 

Val Leu Thr Phe Glu Leu Ser Arg Phe Glu Phe Asn Gin Ala Leu Gly 
370 375 380 

Arg Pro Glu Lys lie His Asn Lys Leu Glu Phe Pro Gin Val Leu Tyr 
385 390 395 400 

Leu Asp Arg Tyr Met His Arg Asn Arg Glu lie Thr Arg lie Lys Arg 
405 410 415 

Glu Glu He Lys Arg Leu Lys Asp Tyr Leu Thr Val Leu Gin Gin Arg 
420 425 430 

Leu Glu Arg Tyr Leu Ser Tyr Gly Ser Gly Pro Lys Arg Phe Pro Leu 
435 440 445 

Val Asp Val Leu Gin Tyr Ala Leu Glu Phe Ala Ser Ser Lys Pro Val 
450 455 460 

Cys Thr Ser Pro Val Asp Asp He Asp Ala Ser Ser Pro Pro Ser Gly 
465 470 475 480 

Ser He Pro Ser Gin Thr Leu Pro Ser Thr Thr Glu Gin Gin Gly Ala 
485 490 495 

Leu Ser Ser Glu Leu Pro Ser Thr Ser Pro Ser Ser Val Ala Ala He 
500 505 510 

Ser Ser Arg Ser Val He His Lys Pro Phe Thr Gin Ser Arg He Pro 
515 520 525 

Pro Asp Leu Pro Met His Pro Ala Pro Arg His He Thr Glu Glu Lys 
530 535 540 

Leu Ser Val Leu Glu Ser Cys Leu His Arg Trp Arg Thr Glu He Glu 
545 550 555 560 

Asn Asp Thr Arg Asp Leu Gin Glu Ser He Ser Arg He His 7U:g Thr 
565 570 575 

He Glu Leu Met Tyr Ser Asp Lys Ser Met He Gin Val Pro Tyr Arg 
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380 cpc 

Leu Hia Ma Val Leu Val His oiu Gly oin Ala Asn Ala Gly His Ty/ 

600 

Trp Ala Ty. lie Phe Asp His Arg clu Ser Arg Trp Met Lys Tyr Asn 

620 

ASP lie Ala Val Thr Lys Ser Se. Trp Glu Glu Leu Val Arg Asp Ser 

"° "5 
Phe Gly Gly Tyr Arg Asn Ala Ser Ala Tyr Cys Leu Met Tyr He Asn 

«0 . 655 

ASP Lys Ala Gin Phe Leu He Gin Glu Glu Phe Asn Lys Glu Thr Gly 

670 

Gin Pro Leu Val Glv He Kin Th^ t 

ly He Glu Thr Leu Pro Pro Asp Leu Arg Asp Phe 

" ^80 685 

val Glu Glu ASP Asn Gin Arg Phe Glu Lys Glu Leu Glu Glu Trp Asp 

700 

Ala Gin Leu Ala Gin Lys Ala Leu Gin Glu Lys Leu Leu Ala Ser Gin 

•'IS 720 

Lys Leu Arg Glu Ser Glu Thr Ser Val Thr Thr Ala Gin Ala Ala Gly 
" 730 

ASP Pro Glu Tyr Leu Glu Gin Pro Ser Arg Ser Asp Phe Ser Lys His 
^° 

Leu Lys Glu Glu Thr lie Gin Xle He Thr Lys Ala Ser His Glu His 
" 760 765 

Glu Asp Lys Ser Pro Glu Thr Val To., n^ o 

7-,n inr vai Leu Gin Ser He Met Met Thr Pro 

^ 77S 

780 

Asn Met Gin Gly He He Met Ala He Gly Lys Ser Arg Ser Val Tyr 

790 

800 

ASP Arg cys Gly Pro Glu Ala Gly Phe Phe Lys Ala He Lys Leu Glu 

810 815 

Tyr Ala Arg Leu Val Lys Leu Ala Gin Glu Asp Thr Pro Pro Glu Thr 

^25 
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835 840 845 

Pro Lys Lys lie lie Glu Lys Thr Leu Leu Glu Gin Phe Gly Asp Arg 
850 855 860 

Asn Leu Ser Phe Asp Glu Arg cys His Asn He Met Lys Val Ala Gin 
865 870 875 880 

Ala Lys Leu Glu Met He Lys Pro Glu Glu Val Asn Leu Glu Glu Tyr 
885 890 895 

Glu Glu Trp His Gin Asp Tyr Arg Lys Phe Arg Glu Thr Thr Met Tyr 
90C 905 910 

Leu He He Gly Leu Glu Asn Phe Gin Arg Glu Ser Tyr He Asp Ser 
915 920 925 

Leu Leu Phe Leu He Cys Ala Tyr Gin Asn Asn Lys Glu Leu Leu Ser 
930 935 940 

Lys Gly Leu Tyr Arg Gly His Asp Glu Glu Leu He Ser His Tyr Arg 
945 950 955 960 

Arg Glu Cys Leu Leu Lys Leu Asn Glu Gin Ala Ala Glu Leu Phe Glu 
965 970 975 

Ser Gly Glu Asp Arg Glu Val Asn Asn Gly Leu He He Met Asn Glu 
980 985 990 

Phe He Val Pro Phe Leu Pro Leu Leu Leu Val Asp Glu Met Glu Glu 
995 1000 1005 

Lys Asp He Leu Ala Val Glu Asp Met Arg Asn Arg Trp Cys Ser Tyr 
1010 1015 1020 

Leu Gly Gin Glu Met Glu Pro His Leu Gin Glu Lys Leu Thr Asp Phe 
1025 1030 1035 1040 

Leu Pro Lys Leu Leu Asp Cys Ser Met Glu He Lys Ser Phe His Glu 
1045 1050 1055 

Pro Pro Lys Leu Pro Ser Tyr Ser Thr His Glu Leu Cys Glu Arg Phe 
1060 1065 1070 

Ala Arg He Met Leu Ser Leu Ser Arg Thr Pro Ala Asp Gly Arg 
1075 1080 1085 
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<210> 6 

<211> 3803 = 

<212> DNA 

<213> Homo sapiens 



<400> 6 



.=.9tC5,c5 tt.c,cc,cc t,cc.,c,gt ,„c,cg„c .tcocttoc 60 

c,cct„ct, ,c,„„c,c tgtcctccc. ,<,„,„c,c ,„,ctccc. g^alclc L 120 

~ =: ~:: ~' '~ "° 

:~ "= = - 

t;,o^^. aggatagtaa tggaaacttg gaattagcag tggctttcct- 360 

tactgcgaag aatgctaaga cccctcaacA rrrr^^^, yy-i-ttccc 360 

tr^r-r . cccctcagca ggaggagaca acttactacc aaacaacact 470 

= = =: := :::::::::: ~: 

cagggcattc agggagactg gaataactga tqaaaaacaa ar-Lt. =<=g3atcaaa 540 

agccagcata gcagagaata aaacatct!t ^^^^""^ gccattagca gagttcttga 600 
ttcccgaaac LttaLata caaa ^^^^^^^^^ cctacagaag tttggaggga 660 

~ ~ :::::::::: = :~ -r- - 

a*.^,, *. am^dgttgaa attcttaagg atqctttcaa cxcn 

<=l,l.U IZZT, IZT", <:...,.=ccc 1080 

t,.«,cct, ,..,c!,c.. !^.*!':": "5=t=t,c. i^oo 

taagagggaa gagatcaaga ^a!:;!:' t aL':"'' ""'^'"^ "^"^^^^^ '''' 
aagatattta agctatgg!t LggLcc^ acaat. ' ^'^"^"^'^ aaaggctaga 1500 
tgcattggaa tttgcctla .ZtTZ ZTZl T.lllT' """^^^^ ^^^^ 
tagttcccca cctagtggtt ccataccaL cctgttgacg atattgacgc 1620 

gggagcccta tctJa!!ac " acagacatta ccaagcacaa cagaacaaca 1680 

:r:a:ca::: a"::::::: a"::::::: ;:™ "r"^='^ "^"^-^ 

tccggcacca aggcacataa cggagg g a""!! L""^'"' 
ctggaggaca gaaatagaaa atgaLcca^ agatttaca! ^tttacatcg I860 

tcgaacaatt gaattaatgt acLtgaca! a!ctat ^ ^'^^^"^^^ ccagaatcca 1930 
tgccgtttta gttcacgaag gccaagctaa taT atcgattaca 1980 

tcatcgtgaa agcagatgga tga L I ZTTI ^^"^^^""^ ^-^""^ga 20.0 
agagctagtg agggactctt ttgg a ZZTZl 'T^^'^' "'"'^^^^ 

cataaatgat aaggcacag. J.llLl a ! aa^a^"":: L"""'" 

ccttgttggt atagaaacat taccaccgga tLgLgat ttlall =*^^3^"9cc 2220 

:= :~ ~ - 
~ ~ :~ - 
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agaagaaact attcaaataa ttaccaaggc 
aacagttttg cagtcggcaa ttaagttgga 
agacacccca ccagaaaccg attatcgttt 
ccaggcacca aagaaaatta ttgagaaaac 
gagttttgat gaaaggtgtc acaacataat 
aaaacctgaa gaagtaaact tggaggaata 
cagggaaaca actatgtatc tcataattgg 
agattccttg ctgttcctca tctgtgctta 
cttatacaga ggacatgatg aagaattgat 
attaaatgag caagccgcag aactcttcga 
tttgattatc atgaatgagt ttattgtccc 
ggaagaaaag gatatactag ctgtagaaga 
tcaagaaatg gaaccacacc tccaagaaaa 
ttgttctatg gagattaaaa gtttccatga 
tgaactctgt gagcgatttg cccgaatcat 
aagataaact gcacactttc cctgaacaca 
tgccttcctg tcacagggtt tgcttgttgc 
ataactgcaa aagacaaaat gactatacag 
aaaatcgcat ggcgctcaga cattttaacc 
attttattat ggcaaaacta tgcttttgcc 
atcttttctt tctcaacagc tttccattca 
aagtgttcag cactgtgtac gatacataat 
aaagttttat ttatggctac eta 



PCT/GBOO/02446 

atcacatgag catgaagata aaagtcctga 2520 
atatgcaagg ttggttaagt tggccca^ga 2580 
acatcatgta gtggtctact ttatccagaa 2640 
attactagaa caatttggag atagaaattt 2700 
gaaagttgct caagccaaac tggaaatgat 2760 
tgaggagtgg catcaggatt ataggaaatt 2820 
gctagaaaat tttcaaagag aaagttatat 2880 
tcagaataac aaagaactct tgtctaaagg 2940 
atcacattat agaagagaat gtttgctaaa 3000 
atctggagag gatcgagaag taaacaatgg 3060 
atttttgcca ttattactgg tggatgaaat 3120 
tatgagaaat cgatggtgtt cctaccttgg 3180 
gctgacagat tttttgccaa aactgcttga 3240 
gccaccgaag ttaccttcat attccacgca 3300 
gttgtccctc agtcgaactc ctgctgatgg 3360 
ctgtataaac tctttttagt . tcttaaccct 3420 
tgctatagtt tttaactttt ttttatttta 3480 
actttagtca gactgcagac aataaagctg 3540 
ggaactgatg tataatcaca aatctaattg 3600 
accttcctgt tgcagtatta ctttgctttt 3660 
gtctggatcc ttccatgact acagccattt 3720 
atttggtagc ttgtaaatga aataaagaat 3780 

3803 



<210> 7 

<211> 3169 

<212> DNA 

<213> Homo sapiens 

<400> 7 

catgaccgtg gagcagaacg tgctgcagca 
tttgaatcaa ctgagagaaa ttacggggat 
gaaggatagt aatggaaact tggaattagc 
gacccctcag caggaggaga caacttacta 
catcagtgtg ggaagccaag cagatacaaa 
tgatcttcag agagcaattg ccttgagttt 
tggaataact gatgaggaac aagccattag 
taaagcatgt ttgaagagga cacctacaga 
tagaaaaaga caggacaaag ctcccgttgg 
tagtgctgtt attcagtcat tatttaatct 
caagcctcca tcaaatgctc aagatttacc 
ttttatgcgt gagctgaggt atctatttgc 
tgatccatca agagcagttg aaattcttaa 
gcaagatgtg agtgagttta cacacaaatt 
gaaagctgaa gaggagacgg atgaagagaa 
tggcagattc ctggctgtgg gagtacttga 
tggtcagtac ccacttcagg tcaatgggtt 

12 
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